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Secreted cc-Amylase As A Reporter Gene 



The present invention relates to a secreted reporter gene system. More 
specifically, the present invention relates to a secreted reporter gene system based on 
5 one or more a-amylases, which are electrophoretically distinct variants of each other, 
for simultaneous expression in mammalian cells. 

BACKGROUND OF THE INVENTION 

Reporter genes come in two basic varieties; those that encode intracellular 
10 enzymes, and those that code for secreted reporter proteins. Most reporter genes 
currently in use are of the intracellular variety that require manipulation of the cells, 
normally harvesting and lysis, prior to measurement of the reporter activity. In 
contrast, cells transformed with constructs that encode secreted reporter enzymes do 
not require manipulation or lysis of the cells for activity measurements. Instead, 
15 aliquots of the extracellular medium are harvested and assayed directly for enzyme 
activity. Thus, secreted reporter genes can be used to monitor gene expression in 
vitro, in vivo and ex vivo. 

There are a number of commercially available reporter genes that encode 
20 intracellular enzymes, e.g. green fluorescent protein (GFP; Chalfie et al., 1994, 
Science 263: 802-805 ), luciferase (de Wet et aL, 1987, Mol. Cell. Biol. 7: 725-737), 
chloramphenicol acetyltransferase (CAT; Gorman et al. 9 1982, Mol. Cell. Biol. 2: 
1044-1051). However, none of these intracellular enzymes are suitable as a secreted 
reporter activity. For instance, the luciferase reporter gene is used widely due mainly 
25 to the sensitivity of the detection system and the lack of background activity in most 
biological systems. However, the luciferase enzyme is relatively unstable under any 
conditions, and is completely and irreversibly inactivated by the process of secretion. 

Secretable human placental alkaline phosphatase (SEAP); a truncated form of 
30 membrane-bound alkaline phosphatase; (Berger, J., Hauber, J., Hauber, R., Geiger, 
R., Cullen, B.R. 1988. Gene. 66: 1-10; Cullen, B.R., and Malim, M.H. 1992. 
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Methods Enzymol. 216: 362-368) is a reporter gene that is distributed by Clontech as 
a secreted reporter gene/enzyme system (see European Patent application 0 327 960). 
However, many biological systems display endogenous (background) alkaline 
phosphatase (AP) activity. For example, the high level of alkaline phosphatase activity 
in cows' milk precludes the use of SEAP as a reporter activity for experiments aimed 
at using the bovine udder as a "bioreactor" in the production of bioengineered milk. 
A second commercially-available, secreted reporter gene is human growth hormone 
(hGH, Boehringer Mannheim; Selden, R.F., Howie, K., Rowe, M.E., Goodman, H, 
Moore, D. 1986, Mol. Cell. Biol. 6: 3173-3179). However, since hGH has no easily 
measurable biological activity, the detection system for this secreted product uses a 
laborious, indirect antibody-based ELISA method. 

Thus, there is a need for an improved secreted reporter gene that overcomes the 
problems of the prior art systems. Specifically there is a need for a secreted reporter 
gene encoding a biological activity that, unlike hGH, is safe and easily measured, but 
that does not have the background activity problems associated with SEAP. 

SUMMARY OF THE INVENTION 

Thus according to the present invention there is provided a secreted reporter 
gene system for mammalian cells. More specifically the present invention relates to 
a secreted reporter gene system based on one or more variants of a-amylase, that have 
different electrophoretic mobilities from that of mammalian a-amylases, and that can 
be efficiently expressed and secreted by mammalian cells. 

The reporter gene system of the present invention comprises a DNA sequence 
with the following components: 1) a signal peptide coding region; 2) a sequence 
encoding one or more variants of the a-amylase mature protein; and 3) a transcription 
termination region. 



In one embodiment of the present invention, the a-amylase gene is the chicken 
(Gallus gallus) a-amylase gene. 
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In one embodiment of the present invention, the a-amylase gene is the fruit fly 
(Drosophila melanogaster) a-amylase gene. 

In one embodiment of the present invention, the reporter gene system comprises 
5 a DNA sequence based on the a-amylase gene from the chicken (Gallus gallus), in 
conjunction with the a-amylase gene from the fruit fly (Drosophila melanogaster), in a 
dual-reporter plasmid. 

In a further embodiment of the present invention, the signal sequence is based on 
10 the Drosophila melanogaster a-amylase secretion signal. 

In yet a further embodiment of the present invention the transcription termination 
region is based on the Drosophila melanogaster a-amylase gene termination region. 

15 In a further aspect of the present invention the reporter gene system also includes 

a transcription promoter region. 

The present invention is also directed to a vector comprising the reporter gene 

system. 

20 

The present invention also defines a procedure to assay for a-amylase activity in 
complex biological mixtures of proteins. 

In yet a further embodiment of the present invention, there is provided a plate 
25 diffusion assay, which is used to measure a-amylase activity from, a large number of 
samples. 

Also included within the scope of the present invention is a kit including the 
reporter gene system of the present invention and reagents for testing a-amylase activity. 

30 
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BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features of the invention will become more apparent from the 
following description in which reference is made to the appended drawings wherein: 
FIGURE 1 shows the sequence of the chicken a-amylase cDNA (SEQID No: 1). 
5 FIGURE 2 shows an example of the reporter gene construct which does not contain 
a promoter region. The a-amylase mature peptide coding sequence 'amylase', 
and transcription termination region 'tef are indicated. The solid black 
rectangle between the promoter and the amylase coding region represents the 
secretion signal peptide. Restriction enzyme recognition sites are as follows: 
10 As = Ascl ; Hi = //mdlH; Nc = Ncol\ Nh = Nhel; No = Notl. The 8-cutter 

sites Notl and Ascl were built into the upstream and downstream ends, 
respectively, of the reporter gene in order to facilitate its insertion into the 
reporter construct. 

FIGURE 3 shows an example of the reporter gene construct which contains a 
15 promoter region. The description is as above for Figure 2, except the 

transcription promoter 'pro' is also indicated. 
FIGURE 4 shows the sequence of the transcription termination region (SEQID No: 2). 
FIGURE 5 shows the electrophoretic mobilities of a-amylases from 3 different 
sources in a native gel assay. Lane 1, pig pancreatic amylase; Lane 2, D. 
20 melanogaster amylase; Lane 3, chicken pancreatic amylase. 

FIGURE 6 shows the production of the reporter activity by mammalian cells in 
culture. Lane 1, chicken pancreatic amylase control; Lane 2, mouse (PG13) 
cells; Lane 3, mouse (PA317) cells; Lane 4, hamster (CHO) cells; Lane 5, 
monkey (MA104) cells, Lane 6, cow (ET2) cells; Lane 7, human (Hela) cells; 
25 Lane 8, untransformed mouse cells (PG13). 

FIGURE 7 shows the expression levels of amylase under different promoters. Lane 
1, medium from control, untransformed PA317 cells; Lane 2, cells transformed 
with reporter construct containing LTRsamy reporter gene; Lane 3, cells 
transformed with reporter contruct containing Actin:amy reporter gene; Lane 
30 4, cells transformed with reporter construct containing CMV:amy reporter gene; 

Lane 5, chicken amylase standard. 
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FIGURE 8 shows the detection of amylase activity in cows' milk. Lane. 1, whole 
milk; Lane 2, whole milk spiked with chicken amylase; Lane 3, empty; Lane 
4, chicken amylase standard; Lane 5 & 6, milk following removal of caseins; 
Lanes 7 & 8, milk with amylase spike following removal of caseins. 
5 FIGURE 9 shows the isolation and detection of a-amylase in cows' pre-lactation 
secretions (pre-milk). Lanes 1, 2 & 3, early premilk from 3 quarters of cow 
#99. Lane 1, control quarter (RF-); Lane 2, treated quarter (RH + ); Lane 3, 
treated quarter (LH+). Lanes 4, 5 & 6, late premilk from cow #99. Lane 4, 
treated quarter (LH+), Lane 5, treated quarter (RH+); Lane 6, control quarter 
10 (RF-). Lane 7, mixture of pig and chicken amylase standards; Lane 8, chicken 

amylase standard. 

FIGURE 10 is a schematic depiction of the plasmids used in the dual-reporter system. 
"Fly" refers to D. melanogaster; "chick" refers to G. gallus. Abreviations 
are: ss = signal sequence; amy = a-amylase; CMVp = immediate early promoter 

15 of the human cytomegalovirus; LpA=late polyadenylation site; ori = origin of 

replication; TB= transcriptional blocker; actin p=rat 0-actin promoter. 
FIGURE 11 shows the production of a-amylase reporter activities by cultured 
mammalian cells transfected with either the pUC-TEST plasmid or the pU(S- 
CONTROL plasmid. Lane 1, chicken amylase standard; Lane 2, fly amylase 

20 standard; Lane 3, medium from control, untransformed PA317 cells, Lane 4, 

cells transformed with pUC-TEST=base dual reporter construct; Lane 5, cells 
transformed with pUC-CONTROL==dual reporter construct containing the actin 
promoter inserted upstream of the fly amylase reporter gene. 
FIGURE 12 shows the reporter activites of PA317 cells co-transformed with chicken 

25 a-amylase and the firefly luciferase. The reporter genes in each co- 

transformation were placed under the same promoter: the murine retroviral 
LTR, the rat p-actin promoter, or the human CMV early immediate promoter. 
Results for amylase activity were obtained by densitometric scanning of the 
photograph of the native gel containing the amylase activity obtained from the 

30 cell supernatant. Luciferase activity was assayed from cell lysates according 

to manufacturer's instructions (Promega Corp). 
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DESCRIPTION OF PREFERRED EMBODIMENT 

The present invention relates to a secreted reporter gene system. More 
specifically, the present invention relates to a secreted reporter gene system based on 
one or more a-amylases, which are electrophoretically distinct variants of each other, 
and also distinct from mammalian a-amylases, for simultaneous expression in 
mammalian cells. 

An a-amylase reporter gene encodes a biological activity that is easily measured 
by a variety of liquid or semi-solid phase assay systems. The use of a native gel assay 
system eliminates any potential interference by endogenous amylase activities produced 
by the transformed, mammalian target cells. In addition, the advantage of using a 
secreted reporter activity is that it is not necessary to harvest cells from the culture 
dish, or to biopsy a transformed tissue (e.g. udder epithelium), nor is it necessary to 
lyse cells prior to assay. Instead, the cell culture supernatant or biological fluid (e.g. 
milk) is recovered from the transformed culture or animal and assayed for reporter 
activity. This feature has several advantages such as the ability to conduct multiple 
measurements over time on a single transformed population of cells (time course 
measurements). 

Since a-amylase exists in a number of distinct isoforms, an additional 
advantage to using the amylase reporter system is that a different isozyme can be used 
as an internal control in transfection experiments. The "normalizing" isozyme reflects 
a standard level of gene expression, while the "test" isozyme is the indicator enzyme. 
Since the isozymes have different migration patterns on the native gel, they can be 
assayed simultaneously within the same gel lane. This simplifies the assay procedure, 
in contrast to other reporter systems where the test and normalizing activities require 
separate assays. 

The chicken and Drosophila a-amylase enzymes are two examples of suitable 
a-amylases, which can be used alone or together in both embodiments of the present 
invention. These enzymes are compatible with the mammalian cellular translation and 
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secretion machinery. Therefore, the mammalian cells are able to produce and secrete 
them efficiently in an authentic, bioactive form. Furthermore, these a-amylases have 
the same physio-chemical requirements for optimal activity and stability, for example, 
pH optimum, calcium activation, etc., as mammalian amylases. These features are 
5 essential for their use as reporter activities. In addition, the electrophoretic mobilities 
of the mature, active chicken and Drosophila a-amylase enzymes on native 
poly aery lamide gels are different from that of mammalian amylases. Therefore if any 
amylase enzyme is secreted from mammalian cells, the reporter activities can easily 
be differentiated from the background activity. 

10 

The absence of background activity is particularly important when measuring 
reporter activities in biological fluids, such as cows' milk, which contains high 
background levels for other reporter systems, e.g. the SEAP reporter system. 
According to the present invention, the a-amylase reporter system is mostruseful in 
15 complex protein mixtures such as biological fluids, specifically fetal calf serum, milk, 
and pre-lactation secretions, because the reporter a-amylase activity:.: can be 
quantitatively recovered, using the isolation procedure included in this invention, from 
such fluids, which will be described in detail below. 

20 Included within the scope of the present invention are modified DNA sequences 

encoding functionally active a-amylases from chicken and Drosophila, Possible 
modifications include but are not limited to: 1) nucleotide substitutions that eliminate 
restriction enzyme (RE) sites in the naturally-occurring sequence, but do not alter the 
amino acid sequences, and the use of these RE sites in a multiple -cloning-region for 

25 the introduction of DNA components, e.g. the transcription promoter; 2) conservative 
amino acid substitutions; 3) the enhancement of the specific activity of the reporter 
polypeptide by amino acid substitutions; 4) increasing the temperature stability of the 
reporter enzyme by amino acid substitutions; 5) creation of hybrid reporter enzymes 
from portions of genes coding for different isozymes. Also included is the potential 

30 use of amylase inhibitors to specifically inhibit any endogenous, mammalian amylases 
that might be present in the extracellular medium of the transformed cells. 
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The reporter gene system of the present invention also includes a DNA 
sequence encoding a signal peptide coding region, which in one embodiment is based 
on the Drosophila melanogaster a-amylase pre-protein. This signal peptide works 
effectively in combination with the chicken a-amylase enzyme and the mammalian 
cellular secretion machinery. The DNA sequence encoding the signal peptide was 
modified to include useful restriction enzyme sites for the introduction of promoter 
sequences to drive the reporter gene. The present invention also encompasses any 
modifications of the signal peptide region that may increase the efficiency of the 
secretion process beyond the current level. 

The reporter gene system of the present invention also includes a transcription 
termination region. In one embodiment of the present invention the transcription 
terminator region is derived from the Drosophila melanogaster amylase gene. This 
portion of the reporter gene serves the following functions: (1) to provide a 3' non- 
coding region for the transcript; (2) to provide a transcription termination signal for 
the RNA polymerase complex; and (3) to provide a poly-adenylation motif to the 
primary transcript. In the example where two amylase genes are used in the reporter 
gene system and one of said genes is the D. melanogaster a-amylase gene, the 
terminator region comes from the late polyadenylation site of SV40, to minimize 
possible recombination with the a-amylase cassette. 

Included within the scope of the present invention is the addition of any 
mnctional DNA elements, such as transcription enhancer elements, introns, etc., to the 
transcription terminator region in order to enhance the overall level of expression of 
the reporter gene. 

In one embodiment of the present invention, the reporter gene system also 
includes a DNA sequence encoding a transcription promoter region. When the 
reporter gene system of the present invention includes a promoter, the primary use of 
this secreted reporter gene system is to monitor the efficiency of the introduction of 
foreign DNA into mammalian cells in vivo, in vitro and ex vivo. This application 
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facilitates experiments designed to optimize the conditions, methods and vehicles 
employed for the introduction of foreign DNA into mammalian cells. There are a 
number of high expression level, commercially available promoters which are suitable 
for this application. Not to be construed as limiting, possible examples include: a 
Cytomegalovirus promoter, a murine retroviral promoter and a rat beta-actin promoter. 



One aspect of the invention would be the use of a tissue-specific promoter to 
drive the expression of the reporter gene. This configuration would enable the 
researcher to distinguish between expression in target and non-target cell populations. 

10 Furthermore, it would allow researchers to optimize their delivery systems for 
targeting the foreign DNA to cell types that have the greatest capacity for producing 
and secreting foreign peptides. This aspect is especially important for in vivo gene 
transfer. Specific applications would include gene transfer to udder epithelial cells in 
mammals for the production of pharmaceutical agents into the milk (the somatic 

15 bioreactor concept), and DNA transfer for gene therapy in humans. 

The construction of the reporter gene involves the use of standard recombinant 
DNA methods, including the use of restriction enzymes to fuse DNA molecules witfr 
coherent ends, the amplification of DNA fragments using the polymerase chain reaction 

20 (PCR; Canadian patent 1,237,685), and the synthesis of DNA fragments using 
phosphoramidite chemistry (Matteucci and Caruthers, 1981, J. Am. Chem. Soc. 103: 
3185-3191). The application of the reporter gene involves the insertion of the gene 
into a suitable vector to form double-stranded, circular DNA molecules that are 
delivered to the eukaryote cells. General methods for the preparation and modification 

25 of recombinant DNA molecules have been described by Cohen et al. [U.S. patent No. 
4,237,224], Collins et al. [U.S. patent No. 4,304,863], Sambrook et al. (1989, 
Molecular cloning: A laboratory manual, Cold Spring Harbor Laboratory), and Mullis 
and Faloona (1987, Methods Enzymol. 155: 335-350) . Because they illustrate much 
of the state of the art, these references are hereby incorporated by reference. 



30 
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Once assembled, the vector carrying the reporter gene is delivered to the target 
cells by one of a number of commonly used methods, e.g. viral vectors, calcium 
phosphate co-precipitation, or liposome-mediated delivery. The DNA plasmid vector 
used to carry the reporter gene can provide for either transient or stable maintenance 
5 of the reporter gene in the target, host cells depending upon the type of vector 
employed. 

The production and secretion of proteins, including reporter enzymes, by 
eukaryotic cells is a multi-stage process. Briefly, the process begins with the 

10 transcription of the gene encoding the secreted product, followed by the processing of 
the primary transcript into mRNA, and the transport of the mature messenger into the 
cytoplasm. The mRNA is translated into the preprotein and simultaneously 
translocated into the first compartment of the secretory pathway by ribosomes 
associated with the endoplasmic reticulum. Targeting of the preprotein to the secretion 

15 pathway is mediated by a secretion signal peptide. Within the endoplasmic reticulum, 
golgi apparatus and the secretory vesicles, the primary translation product undergoes 
sorting and maturation to produce the final bioactive protein which is released into the 
extracellular medium. The present invention provides for novel reporter activities 
based on the chicken a-amylase gene and other electrophoretically distinct variants of 

20 a-amylase, which are compatible with and can be secreted efficiently by mammalian 
cells. The present invention also includes a secretion signal sequence that effectively 
targets the reporter peptide to the secretory machinery of mammalian producer cells. 

There are a number of prior reports on the expression and secretion of a- 
25 amylase in microbial systems. In addition, Japanese Abstract 63263086 described the 
use of a-amylase as an indicator gene in a "promoter trap" construct for use in both 
prokaryote and eukaryote systems. The present system however is preferred, in that 
this system is designed to measure gene expression and DNA delivery specifically in 
mammalian cells using electrophoretically distinguishable variants in combination with 
30 an assay system optimized for use with biological fluids. 
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The present invention also includes an isolation procedure whereby the reporter 
activity can be quantitatively recovered from complex protein mixtures such as 
biological fluids, specifically fetal calf serum, milk, and pre-lactation secretions. The 
isolation protocol, though based on the procedures of Vretblad, P., 1974, FEBS Lett. 
5 47(1): 86-89, Silvanovich, M. P., & Hill, A. D., 1976, Anal. Biochem. 73: 43CM33, 
and Iefuji, H., et al., 1996, Biochem. J. 318: 989-996, was adapted especially for use 
in concentrated protein mixtures. Due to the high protein content, the reporter activity 
cannot be assayed using the original protocols. The present protocol differs from the 
original procedures in that 1) the non-specific proteins in the fluid are removed by 
10 addition of 40% ethanol, and 2) the ethanol-containing solution is applied directly onto 
the immobilized P-cyclodextrin to farther reduce the nonspecific background. 

The present invention also includes a "plate diffusion" assay which is one 
method for measuring the reporter activity when large numbers of samples, for 

15 example, culture supernatants from a population of stably selected clones, must be 
assayed. The method described here is adapted from the observation that colonics-ol 
amylase-producing microorganisms form clearing-zones on starch plates (Iefuji, II., 
et al. 1994. Biosci. Biotechnol. Biochem. 58: 2261-2262). In the amylase reporter 
system, the amylase-containing solutions are applied to wells formed in starch- 

20 containing agarose plates; diffusion of the solution through the starch-agarose forms 
clearing zones whose size correlates with the concentration of amylase in the solution. 

Under conditions where cells are transiently transfected with the reporter 
plasmid, the amount of secreted reporter activity measured in the extracellular medium 

25 of a population of cells, transformed with a multicopy plasmid carrying a reporter gene 
which is driven by a strong, systemic promoter, is determined by two variables: 1 ) the 
number of cells that have received the reporter construct and; 2) the average number 
of copies of the DNA construct that each transformed cell received. Thus, under these 
conditions, the secreted reporter activity provides a measure of the efficiency of the 

30 DNA delivery system, i.e. how many cells in the population were transformed with 
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foreign DNA and how many copies, on average, of the transforming vector were 
incorporated into each transformed cell. 

Under conditions where the cells are stably transfected with the reporter 
plasmid, the level of secreted reporter activity would, in addition, depend on: 1) the 
site at which the reporter gene becomes integrated, 2) the metabolic state of the cells, 
and 3) the environmental conditions that may regulate the cell's metabolism in general, 
and the promoter that drives the reporter gene in particular. If two of the above 
conditions are experimentally controlled, the secreted reporter gene can be used to 
monitor the effects of the third variable in time-course-type experiments. 

The present invention also includes amylase reporter kits. Kits containing a 
single or double amylase gene are included within the scope of the present invention. 

The single reporter kit would include: two DNA constructs; a plate assay 
means; and instructions for use. The single reporter kit would optionally include a 
beta-dextrin purification system, which would be needed in high-protein fluids to allow 
detection of the amylase activity. 

The first DNA construct includes a test plasmid containing a single gene 
coding for a single amylase isozyme, with a multiple cloning region just upstream 
where a test promoter can be inserted. The second DNA construct includes a control 
plasmid where the gene is placed under control of a strong promoter. The plate assay 
detects amylase activity very simply. It consists of : a mould to form the gel plate; a 
buffer used to make the agarose media; detection solutions; and a sample of amylase 
as a positive control. The beta-dextrin system, if included, removes non-specific 
proteins from high-protein fluids to allow detection of the amylase activity. It consists 
of: an immobilized beta-dextrin packed into columns; equilibration buffer; wash 
buffer; and elution buffer. The buffers could be included within the kit or be 
substituted with instructions on the buffer preparation. The kit will also include a 
protocol booklet, which would give detailed procedures, as well as simple steps used 
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to remove background signals from high-protein biological fluids such as serum- 
containing media, pre-lactation secretion, and milk. 



The dual amylase report kit will include: two DNA constructs; a gel 
5 electrophoresis assay means; and instructions for use. The dual reporter kit would 
optionally include a beta-dextrin purification system, which would be needed in high- 
protein fluids to allow detection of the amylase activity. 

In this system the first DNA construct includes a test plasmid (containing the 
10 genes coding for both amylase isozymes, one of which is under a "normalizing" 
promoter). The second DNA construct includes a control plasmid containing the 
isozymes under different promoters. Gel electrophoresis allows unambiguous 
identification of the 2 isozymes. The assay consists of: a sample buffer; gel 
preparation buffer; detection solutions and amylase isozymes as positive controls. As 
15 with the single amylase reporter kit the buffers can be included within the kit or they • 
can be replaced with instructions on the buffer preparation. 

If included, the beta-dextrin system is identical to that contained in the Single 
Reporter Kit and the protocol booklet is identical to that contained in the Single 
20 Reporter Kit. 

The kit of the present invention is much more inexpensive than any expression 
kit presently available. Furthermore, amylase is a very stable enzyme, whose signal 
improves with duration of expression. The dual expression kit allows for 
25 normalization of the expression data. Endogenous levels, as mentioned previously are 
not a problem with this detection system. 

While this invention is described in detail with particular reference to preferred 
embodiments thereof, said embodiments are offered to illustrate but do not limit the 
30 invention. 
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EXAMPLES 

EXAMPLE 1: Chicken a-amylase cDNA 

In order to isolate a chicken genomic a-amylase clone, a commercial chicken 
library in the vector k FIX II (Stratagene) was screened with a murine amylase cDNA 
probe (1.2 kbp Pstl fragment of pMPa21, Hagenbuechle et al., 1980, Cell 21: 179- 
187). Hybridization was carried out under conditions of reduced stringency at 37°C 
in a standard buffer system containing 50% formamide (Benkel and Gavora, 1993, 
Animal Genetics 24: 409-413). Following hybridization, the filters were washed twice 
for 15 min each in 2x SSC, 0.1% SDS at 42°C, followed by a single wash in 0.5x 
SSC, 0.1% SDS at 50°C for 30 min. Autoradiograms were exposed at -70 C with 
intensifying screens. 

Eighteen positive signals were detected following the first round of library 
screening. Ten plaques were chosen at random for second screening, and 4 of these 
isolates were still positive after a third round of screening. One clone (A.A-1) was 
chosen for large-scale DNA isolation and fragment subcloning in preparation for 
sequence analysis. 

A series of overlapping subclones spanning the amylase genomic region was 
constructed by inserting restriction fragments of the primary isolate XA-l into the 
vector pUC18. Double-stranded DNA was sequenced by the gene-walking method 
using synthetic oligonucleotide primers. Primers were synthesized on an Applied 
Biosystems 392 synthesizer and deblocked and desalted before use. Sequencing 
reactions were performed using the dye terminator cycle sequencing kit as described 
in the instructions supplied by the manufacturer (Applied Biosystems Inc.). The 
extension products were analyzed on an Applied Biosystems 373A automated 
sequencer, and sequence assembly was performed using MicroGenie software by 
Beckman. 

The chicken amylase coding region was prepared using reverse transcription- 
polymerase chain reaction technology (RT-PCR). Oligonucleotide primers for the PCR 
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step were designed based on the chicken genomic amylase sequence derived as 
described above. Approximately 2 ^g of total RNA from chicken pancreas was used 
as substrate in a reverse transcription (RT) reaction using the Perkin Elmer RT-PCR 
kit components according to the instructions supplied by the manufacturer. The RT- 
reaction was primed with oligo-dT. The oligonucleotide primers used for the PGR 
stage of the RT-PCR reaction were as follows: (i) Chamy-Nhe (5'- 
AT GCTAGC TCAGTACAATCCCAACACTCAGGCT-3 ' , SEQID No: 3) which spans 
the position in the coding region corresponding to the N-terminus of the mature 
enzyme ; and (ii) Chamy-Hin (5 ' -CG AAGCT1AT A ACTTGGC ATC A 
ACGTGAATTG-3 ' , SEQID No: 4) which spans the stop codon of the amylase coding 
region. The primers were designed to amplify the region in the gene encoding the 
mature oc-amylase peptide. In addition, Charny-Nhe converts the environment of the 
signal peptidase cleavage site into an Nhel site, while Chamy-Hin adds a Hindlll site 
immediately downstream of the stop codon (enzyme recognition sites are underlined). 
The modifications introduced by the PCR primers allow the amplified cDNA to be 
inserted into the reporter construct using cohesive overhang ligation. 

PCR amplification of the chicken amylase cDNA was performed using the LA- 
PCR kit as described in the manufacturer's product bulletin (TaKaRa). The amplified 
fragment was sequenced using PCR-based chain termination technology, prior to 
insertion into the reporter construct - this sequence, SEQ ID No: 1, is shown in Figure 
1. 

EXAMPLE 2: Signal Peptide Coding Region 

The signal peptide used in the current reporter gene is modelled on the sequence 
found in the Amy\ gene of the Oregon-R strain of Drosophila melanogaster (Boer and 
Hickey, 1986, Nuc. Acids Res. 14: 8399-8411). The sequence of the native signal 
peptide extends from nucleotide 1 to 54 in Figure 2 of that reference. The 5 -end of the 
sequence encoding the signal peptide was modified to incorporate an Ncol site straddling 
the translation start codon (ATG) - i.e. the sequence AT ATG T was changed to CC ATG 
G. This modification changes the amino acid immediately downstream of the initiator 
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Met from a Phe to a Val. In addition, the 3'-end of the signal sequence was modified to 
accommodate an Nhel site. Here, the original D. melanogaster sequence of ~ 
Ala-Asn-Ala~ (GCC AAC GCC) was changed to -Ala-Leu-Ala- (GCG CTA GCC). 
The DNA fragment encoding the signal peptide was constructed in the laboratory on a 
5 DNA synthesizer, and assembled into the reporter construct using the restriction enzyme 
sites built into its ends. The novel signal peptide sequence used in the reporter construct 
has the following sequence (SEQID No: 5). 

5 1 -CCATGGTTCTGGCCAAGAGCATAGTGTGCCTCGCCCTCCTGGCGGTGGCGCTAGCT-3 ' 
10 3 1 - GGTACCAAGACCGGTTCTCGTATCACACGGAGCGGGAGGACCGCCACCGCGATCGA- 3 1 

NcoX NheX 



15 EXAMPLE 3: Promoter Region 

In this example the immediate early promoter of the Human cytomegalovirus 
(CMV; Stinski and Roehr, 1985, J. Virology 55: 431-441) was used to drive the 
expression of the reporter gene. 

20 Other examples of promoters that have been used successfully in the reporter 

gene include; a murine retroviral LTR, which is a composite of the MMTV (Mouse 
Mammary Tumor Virus; Ponta et aL 1985, Proc. Natl. Acad. Sci. 82: 1020-1024) and 
the MoMSV (Moloney Murine Sarcoma Virus; Lin et aL 1 990, Proc. Natl. Acad. Sci. 87: 
36-40) LTRs, and the rat beta-actin promoter (Nudel et al. 1983, Nucl. Acids Res., 11, 

25 1759-1771). 

DNA fragments to be used as promoters were amplified by PCR using 
oligonucleotide primers that incorporated useful restriction enzyme sites at the upstream 
{Not!) and downstream (NcoT) ends of the amplified promoter sequences. Restricted 
30 promoters fragments were inserted into the reporter gene construct by cohesive end 
ligation. 
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EXAMPLE 4: Transcription Termination Region 

The final component of the reporter gene is the transcription terminator. In this 
example 567 bp of the Drosophila melanogaster Oregon-R amylase gene termination 
region was used. The sequence of this section (SEQID No: 2) is shown in Figure 4. The 
5 DNA fragment was amplified by PCR using primers that incorporated useful restriction 
enzyme sites. A Hin&lTL site was added just upstream of the TAA (stop codon) of the 
amylase coding region. This TAA is out of frame with the chicken amylase coding 
region, which brings in its own stop codon. The downstream end of the terminator is 
modified into an Ascl site. 

10 

EXAMPLE 5: Drosophila melanogaster a-amylase cDNA 

Plaque lifts of a lambda gtlO library containing cDNAs from RNA of a 
Drosophila melanogaster Oregon-R strain were hybridized to a mouse salivary a- 
amylase cDNA probe (the 1650 bp Pstl restriction fragment of pMSal04; 
15 Hagenbuechle et aL 9 1980, Cell 21: 179-187), under conditions of reduced stringency - 
38 °C in a hybridization buffer containing 50% formamide. Following hybridization, 
filters were washed in 1 X SSC, 0.1% SDS at 52 °C. 

Eight postive plaques were identified from an estimated 20,000 plaques screened. 

20 The longest of these cDNA inserts (OR-M7) was subcloned into a plasmid vector 
(pUC13) and subjected to DNA sequencing by standard methods (Benkel et al., Genome 
29: 510-51 5). The DNA sequence was determined as described in Boer & Hickey, 1 986, 
Nucl. Acids Res. 14: 8399-841 1). The extension products were analyzed on an Applied 
Biosystems 373A automated sequencer, and sequence assembly was performed using 

25 MicroGenie software by Beckman. OR-M7 was found to contain the complete coding 
region of the Drosophila melanogaster a-amylase pre-enzyme. 

The DNA fragment encoding the mature Drosophila melanogaster a-amylase 
enzyme was inserted into expression vectors by PCR amplification using the LA-PCR 
30 kit and synthetic oligonucleotide primers. The oligonucleotide primers used for the 
PCR amplification were as follows: (i) Dm-Ql (5 ' - AT GCT AGC GC A ATTCGAC AC 
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CAACTACGC A-3 ' , SEQID No: 6) which spans the position in the coding region 
corresponding to the N-terminus of the mature enzyme; and Dm3-2 (5*- 
CAAGCTTGTTT AC AACTTGGCGTTG ACGT-3 ' , SEQID No. 7) which spans the 
stop codon of the amylase coding region. The primers were designed to amplify the 
region in the gene encoding the mature a-amylase peptide. In addition, Dm-Ql 
converts the environment of the signal peptidase cleavage site into an Nhel site, while 
Dm3-2 adds a Hindlil site immediately downstream of the stop codon (enzyme 
recognition sites are underlined). The modifications introduced by the PCR primers 
allow the amplified cDNA to be inserted into the reporter construct using cohesive 
overhang ligation. The modified Drosophila a-amylase cDNA is SEQID No. 8. 

EXAMPLE 6: The a-amylase dual-reporter gene system 

Construction of the "test" and "control" plasmids began with the plasmid pIBI- 
B, which was formed from the vector pIBI25 (International Biotechnologies Inc.) by 
techniques of PCR amplification using primers with redundant overhangs (Scharf, 
1990), and standard oligonucleotide linker addition (Sambrook et al., 1989). pIBI-B 
consists of: (i) Drosophila a-amylase promoter; (ii) the Drosophila a-amylase signal 
sequence, as described in Example 2; and (iii) the Drosophila a-amylase termination 
region, as described in Example 4. 

The region spanning the chicken a-amylase cDNA was amplified by long-range 
PCR (LA-PCR kit, TaKaRa), incorporating a Nhel site at the 5' primer and a Hindlil 
site at the 3' primer. This PCR product was ligated into the plasmid pIBI-B via the 
Nhel and HindUI sites to form the plasmid pIBI-fAmy, which contains chicken a- 
amylase joined to Drosophila amylase signal sequence. The CMV promoter was 
amplified by PCR from the plasmid pLNCX (Miller, D.M., & Rosman, G.J., 1989, 
BioTechniques, 7(9): 980-990), then inserted into the plasmid pIBI-fAmy, just 
upstream of the 5' end of the Drosophila signal sequence, replacing the Drosophila 
a-amylase promoter and forming the plasmid pIBI-CfAmy. The entire region 
spanning the CMV promoter to the 3' end of the chicken a-amylase gene was amplified 
by long-range PCR, incorporating unique sites at the 3' and 5' ends. This PCR 
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product was ligated to the 3110bp HindHl-Fsel fragment of the plasmid pSEAP2-Basic 
(Clontech), forming the plasmid pUC-CFAmyHF. The Drosophila a-amylase gene, 
including the signal sequence, was amplified by PCR from the cDNA clone; the PCR 
product was inserted upstream of the chicken a-amylase cassette in the plasmid pUC- 
5 CFAmyHF to form the final plasmid pUC-TEST (SEQID No: 9). This "test" plasmid, 
shown in Figure 10, has the chicken a-amylase gene driven by the CMV promoter. 
The Drosophila a-amylase gene is not under any promoter; instead, there is a 
polylinker sequence 5* of the Drosophila a-amylase gene that allows promoters of 
interest to be inserted. In SEQID No:9 the components are located in the following 

10 positions: Drosophila a-amylase signal sequence: 83-134; Drosophila a-amylase: 
135-1575; Drosophila a-amylase terminator: 1588-2128; CMV promoter: 2129-2951; 
Drosophila a-amylase signal sequence: 2952-2999; chicken amylase: 3000-4505; SV40 
late polyadenylation site: 4506-4754; pUC origin of replication: 5033-5676; 
p-lactamase gene: 6684-582; p-lactamase promoter region: 6754-6685; Fl origin: ; 

15 6816-7272; synthetic polyadenylation site: 7462-7450; and pause site from humari 
a2-globin gene: 7464-7555. 

r 

To construct the plasmid pUC-CONTROL, the rat P-actin promoter was 
inserted between the transcription block and the 5 1 end of the Drosophila a-amylase 

20 gene in the plasmid pUC-TEST. This plasmid (SEQID No: 10), shown schematically 
in Figure 10, has the chicken a-amylase gene driven by the CMV promoter, and the 
Drosophila a-amylase gene driven by the p-actin promoter. In SEQID No: 10 the 
components are located in the following positions: pUG origin of replication: 
5384-6027; p-lactamase gene: 7035-6175; P-lactamase promoter: 7105-7036; Fl 

25 origin: 7167-7623; synthetic polyadenylation site: 7753-7802; and pause site (human 
a2-globin): 7815-7906. 

EXAMPLE 7: The Chicken a-amylase reporter gene system 

A variety of methods are available for the delivery of the reporter gene to the 
30 target host cells including: 1) liposome-mediated fusion; 2) micro-injection; 3) particle 
bombardment; 4) viral vectors; and 5) calcium phosphate co-precipitation. 
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For the purpose of this non-limiting illustrative example, the reporter gene 
linked to the CMV promoter has been inserted into the plasmid pIBI25 (International 
Biotechnologies Incorporated) to produce a transient expression construct. In addition, 
the reporter constructhas been delivered to the cells using the calcium phosphate co- 
5 precipitation method. Other methods of DNA delivery are applicable, as is the use of 
vectors that promote the stable maintenance of the reporter gene within the transformed 
cells, e.g. retroviral vectors. 

EXAMPLE 8: Harvesting of the Reporter Activity from Serum-Free Culture 

10 Supernatants: 

Cells were grown under standard mammalian cell culture conditions. The 
media used to culture the cell lines were as follows: (i) for MA104 and Hela, 
Dulbecco's modified Eagle's medium (DMEM) with 10% Fetal Bovine Serum; (ii) for 
CHO, DMEM with 10% Newborn Calf Serum; (iii) for PA317 and PG13, DMEM 

15 with iron-enriched 10% Calf Serum. Cells were plated at a density of 5xl0 5 cells per 
10 cm dish. Calcium phosphate transfection was carried out overnight. Following 
transfection, cells were incubated overnight in cell line-specific media as described 
above. On day 4, the culture medium was removed and replaced with Serum-free 
DMEM for all cell lines. On day 5, samples of medium were collected, and debris 

20 was removed from the samples by centrifiigation at 500xg. 

Cell supernatant was adjusted to 40% (vol/ vol) of ethanol and the mixture 
incubated on ice for 1 hour. The solution was centrifuged at 16,000xg for 20 minutes 
at 4°C and the supernatant dried under vacuum to remove the ethanol. 100-150/xL of 
25 the resulting solution was used for routine analysis by native gel electrophoresis. To 
concentrate the solution, - lmg of carrier protein (e.g. BSA) was added to the sample 
and the solution applied to a Centricon-30 filtration columns (Amicon). As little as 
0.0005U of amylase processed in this manner can be detected by native gel 
electrophoresis. 

30 
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EXAMPLE 9: Harvesting of the Reporter Activity from Serum-Containing 

Culture Supernatants: 

Cells were grown under standard mammalian cell culture conditions as 
described above, except that 10% serum was included in the culture media. The 
procedure used to harvest the reporter protein is the same as that described for 
harvesting from milk (see below). The presence of serum gives some non-specific 
activity; however, this activity is clearly distinguishable from the reporter activity 
when detected by native gel electrophoresis (described in Example 12) because it has 
a different mobility. For an absolutely clean background and for concentrating the 
sample, adsorption on p-cyclodextrin (described in Example 11) can be used. 

EXAMPLE 10: Harvesting of the Reporter Activity from Milk: 

Two alternative methods are used to recover the a-amylase activity from milk. 
In the first method, fresh milk containing a-amylase was de-fatted by centrifugation at 
8000xg, 10 min. at room temperature. The aqueous fraction was acid-curdled with 1M 
HC1 to a final pH of 4.5, then centriftiged at 8000xg for 20 min. The whey 
(supernatant) fraction was neutralized with 1M NaOH to a final pH of 6.7. A portion 
of the final solution was used for analysis. 

In the second method, the de-fatted milk was adjusted to 40% ethanol, placed 
on ice for l-2h, then centriftiged at 16000xg at 4°C. The supernatant was then placed 
under vacuum to remove the ethanol, and a portion of the final solution was used for 
analysis with essentially complete recovery of the reporter activity. For concentrating 
the sample, adsorption on P-cyclodextrin without removing the ethanol (described in 
Example 11) can be used. 

EXAMPLE 11: Harvesting of the Reporter Activity from Pre-laction 
Secretion (Pre-milk): 

Bovine pre-milk is a jelly-like substance that is so viscous it is nearly solid. To 
analyze a-amylase activity in the pre-milk, the sample was diluted 1/3-1/4 with water, 
then adjusted to 40% ethanol. The sample was placed on ice for l-2h, then centriftiged 
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at 16000xg at 4°C to remove the precipitated proteins. p-cyclodextrin-Sepharose 6B, 
prepared as described in Vretblad, P., 1974, FEBS Lett. 47(1): 86-89, was added to 
the sample at a ratio of 40/xL packed beads to 4mL of prepared sample. The sample 
was shaken for 2h at room temperature, then washed 5 times in 1.5mL of wash buffer 
5 (50mM Tris-HCl, ImM CaCl 2 pH7.5). The sample was eluted by incubation for 5 min 
at room temperature in a desired volume of lOmg/mL p-cyclodextrin. As little as 
0.0002U of a-amylase activity per mL of pre-milk can be recovered this way. 

EXAMPLE 12: Detection of Reporter Activity 

10 Amylase activity can be detected and measured by a variety of simple, safe 

procedures. These can be divided into the tube or liquid assays and the assays that 
involve diffusion in a semi-solid medium or electrophoretic separation. 

The preferred protocol is electrophoretic separation of the proteins in the 
15 sample on a native (non-denaturing) polyacrylamide gel ("native gel electrophoresis"). 
This protocol is described in Benkel and Hickey (1986, Genetics 114: 137-144). 
Briefly, a low percentage acrylamide gel allows proteins to be separated on the basis 
of the overall charge of the molecules. Thus, even though chicken and mammalian 
amylases are very similar in molecular weight, they display very different migration 
20 patterns in the native gel assay (see Figure 5). 

Following electrophoretic separation, the gel is incubated in a buffer solution 
containing partially-hydrolyzed starch. The starch granules coat die gel and penetrate 
the gel surface. Staining of the starch-coated gel with iodine results in a gel that shows 
25 clear amylase bands on a dark blue background. 

The main advantage of the gel assay is that the background activity measured 
for the recipient cells is essentially zero. The gel assay is highly sensitive, and can 
easily be converted into a quantitative format by the incorporation of serial dilutions 
30 of an activity standard (see Benkel and Hickey, 1986, Genetics 114: 943-954). In 
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addition, there is a dye-linked starch-based substrate that facilitates real-time activity 
visualization. 

As an alternative to native gel electrophoresis, prepared samples containing a- 
5 amylase can also be assayed by a very simple "plate diffusion" method. 1% agar, 
made in 50mM Tris-HCl, ImM CaCl 2 pH 7.5, containing 0.02% starch, was poured 
onto culture plates. 4mm-diameter holes were made in the agar, into which up to 30/xL 
of sample can be placed. The plates were covered and incubated overnight at 37 °C to 
allow the amylase to diffuse. "Haloes", formed as the starch becomes digested, were 
10 detected by staining with iodine as described in the native gel electrophoresis 
procedure. 

Chicken amylase in cell culture medium and in milk was tested and the activity 
was found to be stable for days at room temperature in a bioactive format. In addition^ 
15 freezing and thawing does not affect the activity of the enzyme. 

Figure 6 shows a variety of mammalian cell lines transformed with the reporter 
construct containing the CMV promoter delivered using a transient-expression vector*. 
All cell lines tested including; CHO (hamster), ET-2 (cow), Hela (human), MA 104 
20 (monkey), PA317 and PG13 (mouse) showed production and secretion of chicken 
amylase according to the native gel assay. Different cell lines appear to secrete 
chicken amylase at different efficiencies. This may reflect the ability of the cells u> 
take up foreign DNA, or it may be a measure of the inherent capacity of the different 
cell types to secrete proteins. 

25 

Figure 7 shows the results of an experiment in which PA317 cells were co- 
transfected with two reporter constructs: one containing chicken a-amylase as a 
reporter; the other contains firefly luciferase as a well-known intracellular reporter. 
In each co-transfection, both reporters are driven by the same promoter, which is the 
30 murine retroviral LTR, the rat P-actin promoter, or the human CMV early immediate 
promoter (see EXAMPLE 3 above for details of the promoters). The supernatants of 
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transiently transfected cells were harvested and analyzed for reporter activity as 
described in Example 8, while cell pellets were processed and analyzed for luciferase 
activity according to the instructions supplied by the luciferase assay kit manufacturer 
(Promega Corp). The results for extracellular amylase reporter activities show that the 
expression level of the CMV promoter is roughly four times that of the (J-actin 
promoter, which in turn is about twice that of the LTR. Similar results were obtained 
with the intracellular luciferase promoter (Figure 12). 

One of the primary applications for secreted reporter genes is in the 
optimization of DNA delivery and transformation in vivo of mammalian cells that 
secrete proteins in to biological fluids. This optimization process is crucial to the 
development of successful approaches for cell transformation in gene therapy. We are 
currently exploring the use of somatic transgenesis of bovine epithelial tissue in the 
development of 'bioreactor' cows for the production of pharmaceutical agents into 
milk. In order to compare the suitabilities of SEAP (Clontech Cat. No. K2041-1) and 
a-amylase as reporter gene/enzyme systems for the transformation of cow udder cells, 
SEAP and a-amylase were spiked into fresh cows' milk their activities measured. The 
SEAP used to spike the milk came from cell culture supernatant; the a-amylase was 
prepared from chicken pancreas. 

Fresh cows' milk was centrifuged at 8,000 xg for 10 minutes at room 
temperature to remove the fat. SEAP cell supernatant and chicken a-amylase was 
added to the aqueous fraction. For the SEAP reconstitution experiment, SEAP activity 
was measured for the following treatments: (i) untreated milk aqueous phase 
("Endogenous" in Table 1); (ii) milk aqueous phase heated to 65 °C for 30 minutes 
("Heated"); (iii) milk aqueous phase with SEAP spike ("Spiked"); and (iv) milk 
aqueous phase with SEAP spike heated to 65°C for 30 minutes ("Spiked/heated"). 

The results demonstrated that there is a high background of endogenous, soluble 
alkaline phosphatase activity in cows' milk. This activity is indistinguishable from 
SEAP. Heating the samples to 65 °C does not decrease the endogenous SEAP activity 
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level; instead it appears that a portion of the endogenous alkaline phosphatase activity 
is present in an inactive, protein-complexed state in milk, and that heating tends to 
promote the release of some of this protein-bound endogenous SEAP activity. This 
results in a further increase in the background SEAP levels measured in the unspiked 
5 milk. The ability of milk proteins to inactivate SEAP is also evident from the dramatic 
decrease in SEAP signal measured in the spiked samples, where only about 1/4 of the 
SEAP activity added in the spike is detectable following the addition of the milk. 
Heating the spiked samples to 65 °C raises the levels somewhat on average, but this 
increase in activity is at least partially due to the release of endogenous alkaline 
10 phosphatase enzyme. 



TABLE 1: Measurement of SEAP activity in cow's milk. 



Sample 


Endogenous 


Heated 


Spiked 


Spiked/heated 


Cow 1 


52 


60 


282 


307 


Cow 2 


59 


62 


198 


190 



Legend: Cows' milk was processed as described in the text. SEAP activities are given 
in 1000's of luminometer units. "Endogenous" refers to the endogenous background 
activity present in the aqueous phase of the milk; "Heated" refers to the background 
20 activity in the milk following a 30 minute incubation at 65 °C; "Spiked" refers to the 
unheated milk following the addition of the SEAP spike; "Spiked/heated" represents 
the spiked sample following a 30 minute incubation at 65 °C. The SEAP spike consisted 
of 1,075 (xlOOO) units. 

25 Increasing the level of the alkaline phosphatase-inhibitor homoarginine in the 

samples had no effect on the SEAP activity measurements. On the other hand, acid 
curdling of the milk to remove caseins, a standard method used to fractionate milk, 
resulted in the loss of the bulk of the SEAP spike. Likewise, adjusting the milk to 
40% ethanol eliminates the SEAP spike, which precipitated out with the non-specific 

30 proteins. 



Figure 8 shows milk samples with and without the addition of chicken a- 
amylase preparations. For whole milk, the gel assay shows a light smear throughout 
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the entire lane (see lane 1). This is due to the effect of the abundant milk casein 
proteins. However, the band of activity corresponding to the chicken amylase reporter 
'spike' is clearly visible even in whole milk (lane 2). Our standard protocol for the 
amylase reconstitution experiments included an acid curdling step to remove caseins 
from the whole milk. This treatment was incompatible with the SEAP reporter system 
because it almost entirely depleted the milk aqueous phase of alkaline phosphatase 
activity (see above). 

Following neutralization, the whey containing the chicken a-amylase spike was 
analyzed by the standard gel assay. This treatment completely eliminated any 
endogenous background activity from the milk samples, and left the chicken amylase 
band clearly visible in the spiked samples (lanes 7&8). Thus, chicken a-amylase is 
superior to SEAP as a secreted reporter enzyme in biological fluids such as cow's 
milk. 

As outlined above, a major advantage of the amylase reporter system over 
existing reporter assays is the ability to detect chicken amylase activity in complex 
biological fluids. Cow's pre-lactation secretions (also known as pre-milk) are 
particularly problematic biological samples due to their extremely high content of 
proteins, fats and carbohydrates. However, using the simple P-cylodextrin adsorption 
procedure described in Example 11, one can recover the chicken a-amylase reporter 
activity amylase quantitatively even from such recalcitrant materials such as early and 
late pre-lactation secretions of pregnant heifers (see Figure 9). Note the complete lack of 
background activity for chicken amylase in the pre-milk of untreated udder quarters 
(lanes 1 & 6), and the presence of reporter activity only in udder quarters that were trans- 
infected with constructs encoding chicken a-amylase (lanes 2, 3, 4 & 5). In this way, 
as little as 0.0002U of a-amylase activity per mL of pre-milk can be detected. 

Figure 10 gives a schematic representation of the dual reporter constructs used 
for promoter strength determinations in transiently transfected, culture cells. To 
measure the strength of a "test" promoter, the pUC-TEST plasmid is used. It consists 
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of two expression cassettes. The Drosophila a-amylase cassette has the Drosophila a- 
amylase cDNA, including the signal sequence and the terminator region. There is no 
promoter driving this reporter gene; instead, a polylinker region is provided 
immediately 3* of the cDNA to facilitate insertion of a promoter sequence of interest. 
5 Just upstream of this polylinker region is a partially synthetic "transcriptional blocker" 
designed to minimize non-specific transcription of a-amylase. The Drosophila a- 
amylase gene is, therefore, the "test" isozyme. The second cassette has the chicken 
a-amylase cDNA joined at the 5' end with the signal sequence from Drosophila 
melanogaster; the terminator region for this cassette comes from the late 
10 polyadenylation site of SV40, to minimize possible recombination with the Drosophila 
a-amylase cassette. The chicken a-amylase gene is driven by the CMV promoter, and 
therefore, acts as the "normalizing" isozyme. The backbone of the plasmid pUC- 
based. 

15 To provide a control showing that both cassettes are functional, the plasmid 

pUC-CONTROL is used. This plasmid is identical to pUC-TEST, except that the rat 
p-actin promoter was inserted upstream of the Drosophila a-amylase. 

Figure 11 shows the results of transfecting PA317 cells in culture with the dual 
20 reporter construct. Lane 3 shows the lack of background activity in the culture medium 
of untransfected PA317 cells. Lane 4 contains culture medium from PA317 cells 
transfected with the plasmid pUC-TEST. The activity band is specific to the 
normalizing activity resulting from the expression of the chicken amylase reporter gene 
under the control of the CMV promoter. Lane 5 contains culture medium from PA317 
25 cells transfected with the vector pUC-CONTROL. The chicken amylase band results 
from the expression of the normalizing isozyme while the Drosophila melanogaster 
(fly) amylase activity band results from the expression of the fly gene under the control 
of the P-actin promoter. The relative signal intensities of the two activity bands within 
the same gel provide a direct comparison of the transcriptional activities of the CMV 
30 and P-actin promoters. The p-actin promoter was inserted into pUC-CONTROL for 
the purposes of this example only. It will be obvious to persons skilled in the art that 
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the pUC-TEST assay system can be used to assay any nucleotide sequence whether 
natural or synthetic for promoter activity. Furthermore, by employing host cells other 
than PA317 cell line, the assay can be used to evaluate the effect of host cell type on 
promoter activity. In this particular instance, 80/xL of culture supernatant, processed 
as described in Example 8, was used. 

All scientific publications and patent documents are incorporated herein by 
reference. 

The present invention has been described with regard to preferred embodiments. 
However, it will be obvious to persons skilled in the art that a number of variations and 
modifications can be made without departing from the scope of the invention as 
described in the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Her Majesty in Right of Canada as Rep. by 

Agriculture and Agri-Food Canada 

(B) STREET: Central Experimental Farm 

(C) CITY: Ottawa 

(D) STATE: Ontario 

( E ) COUNTRY : Canada 

(F) POSTAL CODE (ZIP) : K1A 0C6 

(A) NAME: The University of Ottawa 

(B) STREET: 115 Seraphin Marion 

(C) CITY: Ottawa 

(D) STATE: Ontario 

( E ) COUNTRY : Canada 

(F) POSTAL CODE (ZIP) : KIN 6N5 

(ii) TITLE OF INVENTION: Secreted alpha-Amylase as a Reporter Gene 
(iii) NUMBER OF SEQUENCES: 10 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: CA 0,000,000 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: CM 2,203,613 

(B) FILING DATE: 24 -APR- 1997 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 
GCTAGCTCAG TACAATCCCA ACACTCAGGC TGGGAGGACA TCTATCGTGC ATCTCTTTGA 
ATGGCGCTGG GCCGACATTG CACTGGAGTG C G AAC ACT AT TTAGCTCCTA ATGGGTTTGG 



60 



120 
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AGGAGTTCAG GTTTCTCCTC CAAATGAAAA CATTGTCATT ACTAATCCGA ACAGGCCCTG 180 

GTGGGAAAGA TACCAGCCCA TCAGCTACAA GATCTGCAGT CG AT CGGGC A ATGAAAATGA 240 

ATTCAGAGAC ATGGTGACCA GATGCAACAA TGTTGGAGTT CGTATTTATG TGGATGCTGT 3 00 

TGTCAATCAC ATGTGTGGAT CTATGGGTGG CACGGGCACC CACTCAACAT GTGGGAGCTA 360 

TTTCAACACC GGGACTAGAG ATTTTCCCGC TGTGCCGTAC TCTGCCTGGG ATTTCAATGA 420 

CGGCAAATGT CACACTGCAA GTGGAGACAT CGAAAATTAT GGGGACATGT ATCAGGTCCG 48 0 

GGATTGCAAG TTGTCCAGCC TTCTTGATCT GGCTCTGGAG AAGGACTATG TACGCTCAAC 54 0 

AATTGCAGCG TACATGAATC ACCTCATTGA TATGGGTGTA GCAGGGTTCC GGATCGATGC 60 0 

TGCCAAGCAT ATGTGGCCAG GGGACATAAG AGCGTTTCTG GACAAACTGC ACGATCTAAA 66 0 

TACTCAGTGG TTTTCAGCAG GAACGAAACC CTTTATTTAC CAAGAGGTAA TTGACTTGGG 72 0 

AGGAGAGC C A ATCACAGGCA GTCAGTACTT TGGGAATGGC CGCGTGACAG AATTCAAGTA 78 0 

TGGTGCCAAA CTGGGGACGG TGATCCGGAA GTGGAATGGA GAGAAGATGG CCTACTTAAA 84 0 

GAACTGGGGA GAAGGCTGGG GCTTTGTGCC TTCTGACAGA GCCCTGGTGT TTGTGGATAA 90 0 

CCACGACAAC CAGCGGGGGC ACGGGGCAGG CGGAGCTTCC ATTCTTACTT TCTGGGATGC 96 0 

CAGGCTTTAT AAAATGGCGG TTGGTTTCAT GCTCGCTCAT CCGTACGGGT TCACACGGGT 102 0 

GATGTCAAGT TATCGTTGGC CAAGATATTT CGAAAACGGA GTGGATGTTA ACGACTGGGT 108 0 

GGGACCACCA AGTAACTCGG ACGGATCGAC GAAGTCCGTT ACAATCAACG CAGACACTAC 114 o 

CTGTGGCAAT GACTGGGTCT GCGAACATCG CTGGCGACAA ATAAGGAACA TGGTTATCTT 120 0 

CCGTAATGTG GTAGACGGTC AGCCTTTCTC AAACTGGTGG GACAACGGGA GCAATCAAGT 126 0 

AGCTTTCGGT CGCGGCGACA GAGGCTTCAT TGTCTTTAAT AATGATGACT GGTATATGAA 1320 

TGTCGATTTG CAAACTGGTC TGCCTGCTGG AACCTACTGC GATGTTATTT CTGGACAAAA 13 8 0 

GGAAGGCAGT GCGTGTACTG GAAAGCAGGT GTACGTTTCC TCGGATGGAA AGGCCAATTT 14 4 0 

CCAGATTAGT AACAGCGATG AAGATCCATT TGTTGCAATT CACGTTGATG CCAAGTTATA 1500 
AGCTT 

1505 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

AAGCTTGTAA ACAGCTGGGG AG CATGGCGA ACAGCCAGGC AATTAATTGA GATTATTAAT 60 

TGTACGAAAT ATATATGATG AGATTATAAA CACACAACAC TTTTATTCGC AAGGGATGAT 12 0 

AAGAATCTAA TATATATATT ATCTGGGCTT CAAAGCATTG ATTTTATTTA TTGAGTCAAG 18 0 

AGGGAAATTT ATTTTCTTGT TATTCTCTGT CCAGGTCTAA AGTCCCGAGC GGTGAGGCTA 24 0 

TCTATTGATT TGGACATTCC AATCGAATAC AAAACAGAGA TACAGAAATT TGCGAAAAAA 3 00 

TTTGATAACA ATCGTGGATT TTACGAATTA GACAAATTGA TATGTGCTTG CTAATTGATG 36 0 

TGGCATGAAA TAAGAAATTT ATAAGGACGT TTTCAAGTGC TTCTATTTTA AACATTCAGG 42 0 

ATTTTTTTTT AAAG C AG AC A GCTTTCAACA GGTTTGATGA GAATTTGAAT ATTGATTGTT 48 0 

GACTTTAGCT ATACATAAAT CACACCTCAT CCACCCATTG TGGTATCCTT CGAAGGACTT 54 0 

GGGAACTGGA TCCTCTAGAA GGCGCGCC 56 8 
(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 
ATGCTAGCTC AGTACAATCC CAACACTCAG GCT 3 3 

(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO : 4: 
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CGAAGCTTAT AACTTGGCAT CAACGTGAAT TG 
(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 
CCATGGTTCT GGCCAAGAGC ATAGTGTGCC TCGCCCTCCT GGCGGTGGCG CTAGCT 
(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 
ATGCTAGCGC AATTCGACAC CAACTACGCA 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CAAGCTTGTT TACAACTTGG CGTTGACGT 
(2) INFORMATION FOR SEQ ID NO : 8: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 
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( C ) S TRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATGCTAGCGC AATTCGACAC CAACTACGCA TCCGGTCGTA GTGGAATGGT CCACCTCTTC 6 0 

GAGTGGAAGT GGGACGACAT CGCTGCCGAG TGCGAAAACT TCCTTGGACC CAATGGCTAC 12 0 

GCCGGTGTTC AGGTCTCCCC TGTGAACGAG AACGCCGTCA AGGACAGCCG CCCCTGGTGG 18 0 

GAACGTTACC AGCCCATCTC CTACAAGCTG GAGACCCGCT CCGGAAACGA AGAGCAGTTC 24 0 

GCCAGCATGG TCAAGCGCTG CAACGCCGTC GG AG TGCGC A CCTACGTGGA CGTGGTCTTC 3 00 

AACCACATGG CCGCCGACGG AGGCACCTAC GGCACTGGCG GCAGCACCGC CAGCCCCAGC 36 0 

AGCAAGAGCT ATCCCGGAGT GCCCTACTCC TCGCTGGACT TCAACCCGAC CTGCGCCATC 42 0 

AGCAACTACA ACGACGCCAA CGAGG TGCGC AACTGCGAGC TGGTCGGTCT GCGCGACCTT 480 

AACCAGGGCA ACTCCTACGT GCAGGACAAG GTGGTCGAGT TCCTGGACCA TCTGATTGAT 54 0 

CTCGGCGTGG CCGGATTCCG CGTGGACGCC GCCAAGCACA TGTGGCCCGC CGACCTGGCC 600 

GTCATCTATG GCCGCCTCAA GAACCTAAAC ACCGACCACG GCTTCGCCTC GGGATCCAAG 66 0 

GCGTACATCG TCCAGGAGGT CATCGACATG GGCGGCGAGG CCATCAGCAA GTCCGAGTAC 72 0 

AC CGGACTGG GCGCCATCAC CGAGTTCCGC CACTCCGACT CCATCGGCAA GGTCTTCCGC 780 

GGCAAGGACC AG CTG CAGTA CCTGACCAAC TGGGGCACCG CCTGGGGCTT CGCTGCCTCC 84 0 

GACCGCTCCC TGGTATTCGT CGACAACCAC GACAATCAGC GCGGACATGG AGCAGGAGGC 900 

GCCGACGTTC TGACCTACAA GGTGCCCAAG CAGTACAAGA TGGCCTCCGC CTTCATGCTG 96 0 

GCGCACCCCT TCGGCACTCC CCGCGTGATG TCCTCCTTCT CCTTCACGGA CACCGATCAG 102 0 

GGCCCGCCCA CCACCGACGG CCACAACATC GCCTCGCCCA TCTTCAATAG CGACAACTCC 1080 

TGCAGCGGCG GCTGGGTGTG TGAGCACCGC TGGCGCCAGA TCTACAACAT GGTGGCCTTC 114 0 

CGAAACACCG TGGGCTCGGA CGAGATCCAG AACTGGTGGG ACAACGGCAG CAACCAGATC 12 00 

TCCTTCAGCC GAGGCAGCCG CGGCTTCGTG GCCTTCAACA ACGACAACTA CGACCTGAAC 1260 

AGCTCCCTGC AGACGGGCCT GCCCGCCGGC ACCTACTGCG ACGTCATCTC CGGCTCCAAG 132 0 

AGCGGTTCCT CCTGCACGGG CAAGACCGTC ACCGTCGGAT CCGACGGACG GGCTTCCATC 13 8 0 
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AACATTGGCA GCTCCGAGGA CGACGGAGTG CTGGC CATTC ACGTCAACGC CAAGTTGTAA 
ACAAGCTTG 

(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7562 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 

GGTACCGAGC TCTTACGCGT GCTAGCCCGG GCTCGAGCGG CCGCCAGTGT GATGGATATC 6 0 

TGCAGAATTC GGCTTAGTCG ACCCATGGTT CTGGC CAAGA GCATAGTGTG CCTCGCCCTC 12 0 

CTGGCGGTGG CGCTAGCCCA ACGCCCAATT CGACACCAAC TACGCATCCG GTCGTAGTGG 18 0 

AATGGTCCAC CTCTTCGAGT GGAAG TGGG A CGACATCGCT GCCGAGTGCG AAAACTTCCT 24 0 

TGGACCCAAT GGCTACGCCG GTGTTCAGGT CTCCCCTGTG AACGAGAACG CCGTCAAGGA 3 00 

CAGCCGCCCC TGGTGGGAAC GTTACCAGCC CATCTCCTAC AAGCTGGAGA CCCGCTCCGG 360 

AAACGAAGAG CAGTTCGCCA GCATGGTCAA GCGCTGCAAC GCCGTCGGAG TGCGCACCTA 4 20 

CGTGGACGTG GTCTTCAACC ACATGGCCGC CGACGGAGGC ACCTACGGCA CTGGCGGCAG 4 80 

CACCGCCAGC CCCAGCAGCA AGAG CTATCC CGGAGTGCCC TACTCCTCGC TGGACTTCAA 54 0 

CCCGACCTGC GCCATCAGCA ACTACAACGA CGCCAACGAG GTGCGCAACT GCGAGCTGGT 600 

CGGTCTGCGC GACCTTAACC AGGGCAACTC CTACGTGCAG GACAAGGTGG TCGAGTTCCT 66 0 

GGACCATCTG ATTGATCTCG GCGTGGCCGG ATTCCGCGTG GACGCCGCCA AG CAC ATGTG 72 0 

GCCCGCCGAC CTGGCCGTCA TCTATGGCCG CCTCAAGAAC CTAAACACCG ACCACGGCTT 78 0 

CGCCTCGGGA TCCAAGGCGT ACATCGTCCA GGAGGTCATC GACATGGGCG GCGAGGCCAT 84 0 

CAGCAAGTCC GAGTACACCG GACTGGGCGC CATCACCGAG TTCCGCCACT CCGACTCCAT 90 0 

CGGCAAGGTC TTCCGCGGCA AGGACCAGCT GCAGTACCTG ACCAACTGGG GCACCGCCTG 96 0 

GGGCTTCGCT GCCTCCGACC GCTCCCTGGT ATTCGTCGAC AACCACGACA ATCAGCGCGG 102 0 

ACATGGAGCA GGAGGCGCCG ACGTTCTGAC CTACAAGGTG CCCAAGCAGT ACAAGATGGC 108 0 

CTCCGCCTTC ATGCTGGCGC ACCCCTTCGG CACTCCCCGC GTGATGTCCT CCTTCTCCTT 1140 
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CACGGACACC GATCAGGGCC CGCCCACCAC 
CAATAGCGAC AACTCCTGCA GCGGCGGCTG 
CAACATGGTG GCCTTCCGAA ACACCGTGGG 
CGGCAGCAAC CAGATCTCCT TCAGCCGAGG 
CAACTACGAC CTGAACAGCT CCCTGCAGAC 
CATCTCCGGC TCCAAGAGCG GTTCCTCCTG 
CGGACGGGCT TCCATCAACA TTGGCAGCTC 
CAACGCCAAG TTGTAAGCTT GTAAACAGCT 
TTGAGATTAT TAATTGTACG AAATATATAT 
TCGCAAGGGA TGATAAGAAT CTAATATATA 
TTTATTGAGT CAAGAGGGAA ATTTATTTTC 
GAGCGGTGAG GCTATCTATT GATTTGGACA 
AATTTCGGAA AAAATTTGAT AACAATCGTG 
CTTGCTAATT GATGTGG CAT GAAATAAGAA 
TTTAAACATT CAGGATTTTT TTTTAAAGCA 
GAATATTGAT TGTTGACTTT AGCTATACAT 
CCTTCGAAGG ACTTGGGAAC TGGATCCTCT 
TTGGTTATAT AGCATAAATC AATATTGGCT 
ATAATATGTA CATTTATATT GGCTCATGTC 
TGACTAGTTA TTAATAGTAA TCAATTACGG 
TCCGCGTTAC ATAACTTACG GTAAATGGCC 
CATTGACGTC AATAATGACG TATGTTCCCA 
GTCAATGGGT GGAGTATTTA CGGTAAACTG 
TGCCAAGTAC GCCCCCTATT GACGTCAATG 
AGTACATGAC CTTATGGGAC TTTCCTACTT 
TTACCATGGT GATGCGGTTT TGGCAGTACA 
GGGGATTTCC AAGTCTCCAC CCCATTGACG 
AACGGGACTT TCCAAAATGT CGTAACAACT 
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CGACGGCCAC AACATCGCCT CGCCCATCTT 12 0 0 

GGTGTGTGAG CACCGCTGGC GCCAGATCTA 126 0 

CTCGGACGAG ATCCAGAACT GGTGGGACAA 132 0 

CAGCCGCGGC TTCGTGGCCT TCAACAACGA 13 8 0 

GGGCCTGCCC GCCGGCACCT ACTGCGACGT 144 0 

CACGGGCAAG ACCGTCACCG TCGGATCCGA 15 00 

CGAGGACGAC GGAGTGCTGG CCATTCACGT 156 0 

GGGGAGCATG GCGAACAGCC AGGCAATTAA 162 0 

GATGAGATTA TAAACACACA ACACTTTTAT 16 8 0 

TATTATCTGG GCTTCAAAGC ATTGATTTTA 174 0 

TTGTTATTCT CTGTCCAGGT CTAAAGTCCC 180 0 

TTCCAATCGA ATACAAAACA GAGATACAGA 186 0 

GATTTTAGCA ATTAGACAAA TTGATATGTG 1920 

ATTTATAAGG ACGTTTTCAA GTGCTTCTAT 19 80 

GACAGCTTTC AACAGGTTTG ATCAGAATTT 2 04C; 

AAATC AC AC C TCATCCACCC ATTGTGGTAT 2100 

AGAGCTTCCG GCCATTAGCC ATATTATTCA 21 6 C 

ATTGGCCATT GCATACGTTG TATCCATATC 222 C 

CAACATTACC GCCATGTTGA CATTGATTAT 2 28 0 

GGTCATTAGT TCATAGCCCA TATATGGAGT 2 34C 

CGCCTGGCTG ACCGCCCAAC GACCCCCGCC 2 4 00 

TAGTAACGCC AATAGGGACT TTCCATTGAC 24 6 0 

CCCACTTGGC AGTACATCAA GTGTATCATA 2 52 0 

ACGGTAAATG GCCCGCCTGG CATTATGCCC 2 58 0 

GGCAGTACAT CTACGTATTA GTCATCGCTA 2 64 0 

TCAATGGGCG TGGATAGCGG TTTGACTCAC 2 700 

TCAATGGGAG TTTGTTTTGG CACCAAAATC 2760 

CCGCCCCATT GACGCAAATG GGCGGTAGGC 2 82 0 
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ATGTACGGTG GGAGGTCTAT ATAAGCAGAG CTCGTTTAGT GAACCGTCAG ATCGCCTGGA 2880 
GACGCCATCC ACGCTGTTTT GACCTCCATA GAAGACACCG GGACCGATCC AGCCTCCGCG 294 0 

GCCCCAAGCT CATGGTTCTG GCCAAGAGCA TAGTGTGCCT CGCCCTCCTG GCGGTGGCGC 3000 
TAGCCCAGTA CAATCCCAAC ACTCAGGCTG GGAGGACATC TATCGTGCAT CTCTTTGAAT 3060 
GGCGCTGGGC CGACATTGCA CTGGAGTGCG AACACTATTT AGCTCCTAAT GGGTTTGGAG 3120 
GAGTTCAGGT TTCTCCTCCA AATGAAAACA TTGTCATTAC TAATCCGAAC AGGCCCTGGT 3180 
GGGAAAGATA CCAGCCCATC AGCTACAAGA TCTGCAGTCG ATCGGGCAAT GAAAATGAAT 324 0 

TCAGAGACAT GGTGACCAGA TGCAACAATG TTGGAGTTCG TATTTATGTG GATGCTGTTG 33 00 

TCAATCACAT GTGTGGATCT ATGGGTGGCA CGGGCACCCA CTCAACATGT GGGAGCTATT 336 0 

TCAACACCGG GACTAGAGAT TTTC CCGCTG TGCCGTACTC TGCCTGGGAT TTCAATGACG 34 2 0 

GCAAATGT C A CACTGCAAGT GGAGACATCG AAAATTATGG GGACATGTAT CAGGTCCGGG 34 8 0 

ATTGCAAGTT GTCCAGCCTT CTTGATCTGG CTCTGGAGAA GGACTATGTA CGCTCAACAA 354 0 

TTGCAGCGTA CATGAATCAC CTCATTGATA TGGGTGTAGC AGGGTT CCGG ATCGATGCTG 36 00 

CCAAGCATAT GTGGCCAGGG GACATAAGAG CGTTTCTGGA CAAACTGCAC GATCTAAATA 366 0 

CTCAGTGGTT TTCAGCAGGA ACGAAACCCT TTATTTACCA AGAGGTAATT GACTTGGGAG 3 72 0 

GAGAGCCAAT CACAGGCAGT CAGTACTTTG GGAATGGCCG CGTGACAGAA TTCAAGTATG 378 0 

GTGCCAAACT GGGGACGGTG ATCCGGAAGT GGAATGGAGA GAAGATGGCC TACTTAAAGA 3 84 0 

ACTGGGGAGA AGGCTGGGGC TTTGTGCCTT CTGACAGAGC CCTGGTGTTT GTGGATAACC 3 900 

ACGACAACCA GCGGGGGCAC GGGGCAGGCG GAGCTTCCAT TCTTACTTTC TGGGATGCCA 3 960 

GGCTTTATAA AATGGCGGTT GGTTTCATGC TCGCTCATCC GTACGGGTTC ACACGGGTGA 4 020 

TGTCAAGTTA TCGTTGGCCA AGATATTTCG AAAACGGAGT GGATGTTAAC GACTGGGTGG 4 080 

GACCACCAAG TAACTCGGAC GGATCGACGA AGTC CGTTAC AATCAACGCA GACACTACCT 414 0 

GTGGCAATGA CTGGGTCTGC GAACATCGCT GGCGACAAAT AAGGAACATG GTTATCTTCC 4200 

GTAATGTGGT AGACGGTCAG CCTTTCTCAA ACTGGTGGGA CAACGGGAGC AATCAAGTAG 4260 

CTTTCGGTCG CGGCGACAGA GGCTTCATTG TCTTTAATAA TGATGACTGG TATATGAATG 4320 

TCGATTTGCA AACTGGTCTG CCTGCTGGAA CCTACTGCGA TGTTATTTCT GGACAAAAGG 438 0 

AAGGCAGTGC GTGTACTGGA AAGCAGGTGT ACGTTTCCTC GGATGGAAAG GCCAATTTCC 444 0 

AGATTAGTAA CAGCGATGAA GATCCATTTG TTGCAATTCA CGTTGATGCC AAGTTATAAG 4500 
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GCCGGCCGCT TCGAGCAGAC ATGATAAGAT ACATTGATGA GTTTGGACAA ACCACAACTA 4 56 0 

GAATGCAGTG AAAAAAATGC TTTATTTGTG AAATTTGTGA TGCTATTGCT TTATTTGTAA 4 62 0 

CCATTATAAG CTG C AAT AAA CAAGTTAACA ACAACAATTG CATTCATTTT ATGTTTCAGG 4 68 0 

TTCAGGGGGA GGTGTGGGAG GTTTTTTAAA GCAAGTAAAA CCTCTACAAA TGTGGTAAAA 4 74 0 

TCGATAAGGA TCCGTCGACC GATGCCCTTG AG AG C CTTCA ACCCAGTCAG CTCCTTCCGG 480 0 

TGGGCGCGGG GCATGACTAT CGTCGCCGCA CTTATGACTG TCTTCTTTAT CATGCAACTC 4 86 0 

GTAGGACAGG TGCCGGCAGC GCTCTTCCGC TTCCTCGCTC ACTGACTCGC TGCGCTCGGT 4 92 0 

CGTTCGGCTG CGGCGAGCGG TATCAGCTCA CTCAAAGGCG GTAATACGGT TATCCACAGA 4 98 0 

ATCAGGGGAT AACGCAGGAA AGAACATGTG AGCAAAAGGC CAGCAAAAGG CCAGGAACCG 5 04 0 

TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG AG CATC AC AA 5100 

AAATCGACGC TCAAGTCAGA GGTGG CGAAA CCCGACAGGA CTATAAAGAT ACCAGGCGTT 516 0 

TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC TGTTCCGACC CTGCCGCTTA CCGGATACCT 522 0 

GTCCGCCTTT CTCCCTTCGG GAAGCGTGGC GCTTTCTCAT AGCTCACGCT GTAGGTATCT 52 8 0 

CAGTTCGGTG TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC CCGTTCAGCC 53 4.0 

CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA GACACGACTT 54 00 

ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA GCGAGGTATG TAGGCGGTGC 54 6 0 

TACAGAGTTC TTGAAGTGGT GGCCTAACTA CGGCTACACT AGAAGGACAG TATTTGGTAT 5 52 0 

CTGCGCTCTG CTGAAGCCAG TTACCTTCGG AAAAAGAGTT GGTAGCTCTT GATCCGGCAA 5 58 0 

ACAAACCACC GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA CGCGCAGAAA 56 4 0 

AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC AGTGGAACGA 57 00 

AAACTCACGT TAAGGGATTT TGGTCATGAG ATTATCAAAA AGGATCTTCA CCTAGATCCT 576 0 

TTTAAATTAA AAATGAAGTT TTAAATCAAT CTAAAGTATA TATGAGTAAA CTTGGTCTGA 582 0 

CAGTTACCAA TGCTTAATCA GTGAGGCACC TATCTCAGCG ATCTGTCTAT TTCGTTCATC 5 880 

CATAGTTGCC TGACTCCCCG TCGTGTAGAT AACTACGATA CGGGAGGGCT TAC CAT CTGG 5 94 0 

CCCCAGTGCT GCAATGATAC CGCGAGACCC ACGCTCACCG GCTCCAGATT TATCAGCAAT 6 00 0 

AAACCAGCCA GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT GCAACTTTAT CCGCCTCCAT 6060 

CCAGTCTATT AATTGTTGCC GGGAAGCTAG AGTAAGTAGT TCGCCAGTTA ATAGTTTGCG 6120 

CAACGTTGTT GCCATTGCTA CAGGCATCGT GGTGTCACGC TCGTCGTTTG GTATGGCTTC 6180 
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ATTCAGCTCC GGTTCCCAAC GATCAAGGCG AGTTACATGA TCCCCCATGT TGTGCAAAAA 624 0 
AGCGGTTAGC TCCTTCGGTC CTCCGATCGT TGTCAGAAGT AAGTTGGCCG CAGTGTTATC 6300 
ACTCATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC ATGCCATCCG TAAGATGCTT 6360 
TTCTGTGACT GGTGAGTACT CAACCAAGTC ATTCTGAGAA TAGTGTATGC GGCGACCGAG 6420 
TTGCTCTTGC CCGGCGTCAA TACGGGATAA TACCGCGCCA CATAGCAGAA CTTTAAAAGT 6480 
GCTCATCATT GGAAAACGTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC CGCTGTTGAG 654 0 
ATCCAGTTCG ATGTAACCCA CTCGTGCACC CAACTGATCT TCAGCATCTT TTACTTTCAC 6600 
CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC GCAAAAAAGG GAATAAGGG C 6660 
GACACGGAAA TGTTGAATAC TCATACTCTT CCTTTTTCAA TATTATTGAA GCATTTATCA 672 0 
GGGTTATTGT CTCATGAGCG GATACATATT TGAATGTATT TAGAAAAATA AACAAATAGG 6780 

GGTTCCGCGC ACATTTCCCC GAAAAGTGCC ACCTGACGCG CCCTGTAGCG GCGCATTAAG 684 0 

CGCGGCGGGT GTGGTGGTTA CGCG CAGCGT GACCGCTACA CTTGCCAGCG CCCTAGCGCC 6 900 

CGCTCCTTTC GCTTTCTTCC CTTCCTTTCT CGCCACGTTC GCCGGCTTTC CCCGTCAAGC 6 960 

TCTAAATCGG GGGCTCCCTT TAGGGTTCCG ATTTAGTGCT TTACGGCAC C TCGACCCCAA 7 02 0 

AAAACTTGAT TAGGGTGATG GTTCACGTAG TGGGCCATCG CCCTGATAGA CGGTTTTTCG 7 08 0 

CCCTTTGACG TTGGAGTCCA CGTTCTTTAA TAGTGG ACT C TTGTTCCAAA CTGGAACAAC 714 0 

ACTCAACCCT ATCTCGGTCT ATTCTTTTGA TTTATAAGGG ATTTTGCCGA TTTCGGCCTA 7200 

TTGGTTAAAA AATGAGCTGA TTTAACAAAA ATTTAACGCG AATTTTAACA AAATATTAAC 726 0 

GTTTACAATT TCCCATTCGC CATTCAGGCT GCGCAACTGT TGGGAAGGGC GATCGGTGCG 7 3 20 

GGC CTCTTCG CTATTACGCC AGCCCAAGCT AC CATG AT AA GTAAGTAATA TTAAGGTACG 73 80 

GGAGGTACTT GGAGCGGCCG CAATAAAATA TCTTTATTTT CATTACATCT GTGTGTTGGT 74 4 0 

TTTTTGTGTG AATCGATAGT ACTAACATAC GCTCTCCATC AAAACAAAAC GAAACAAAAC 7 5 00 

AAACTAGCAA AATAGGCTGT CCCCAGTGCA AGTGCAGGTG CCAGAACATT TCTCTATCGA 7 560 
TA 

7562 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGTACCGAAA CGCGCGAGGC AGCTGATCTG AGTCACTGAA TGGGATAGTG TCCACAAGGG 60 

CGGGGGCTAT TCTTGTCCAT CTGGGCCTAC GGAACCAGCA CCCATCGCCA AACTCTTCAT 12 0 

CCTCTTCCTC AATCTCGCTT TCTCTCTCGT CCGCTTTTTT TTTCTTCTTC TTTTTTTTTT 18 0 

TTTTTTTTTT TTTTTTTTGC AAAAGGAGGG GAGAGGGGGT AAAAAAATGC TGCACTGTGC 24 0 

GGCGAGGCCG GTGAGTGAGC GACGCGGAGC CAATCAGCGC CCGCCGTTCC GAAAGTTGCC 3 00 

TTTTATGGCT CGAGTGGCCG CTGTGGCGTC CTATAAAACC CGGCGGCGCA ACGCGCAGCC 36 0 

ACTGTCGAGT CAAGCTTCGA TCGTCGAGCG GCCGCCAGTG TGATGGATAT CTGCAGAATT 42 0 

CGGCTTAGTC GACCCATGGT TCTGGCCAAG AGCATAGTGT GCCTCGCCCT CCTGGCGGTG 48 0 

GCGCTAGCCC AACGCCCAAT T CG AC AC C AA CTACGCATCC GGTCGTAGTG GAATGGTCCA 54 0 

CCTCTTCGAG TGGAAGTGGG ACGACATCGC TGCCGAGTGC GAAAACTTCC TTGGACCCAA 6 00 

TGGCTACGCC GGTGTTCAGG TCTCCCCTGT GAACGAGAAC GCCGTCAAGG ACAGCCGCCC 66 0' 

CTGGTGGGAA CGTTACCAGC CCATCTCCTA CAAGCTGGAG ACCCGCTCCG GAAACGAAGA 72 0 

GCAGTTCGCC AGCATGGTCA AGCGCTGCAA CGCCGTCGGA GTGCGCACCT ACGTGGACGT 78 0 

GGTCTTCAAC CACATGGCCG CCGACGGAGG CACCTACGGC ACTGGCGGCA GCACCGCCAG 84 0 

CCCCAGCAGC AAGAGCTATC CCGGAGTGCC CTACTCCTCG CTGGACTTCA ACCCGACCTG 9 00 

CGCCATCAGC AACTACAACG ACGCCAACGA GGTGCGCAAC TGCGAGCTGG TCGGTCTGCG 96 0 

CGACCTTAAC CAGGGCAACT CCTACGTGCA GGACAAGGTG GTCGAGTTCC TGGACCATCT 102 0 

GATTGATCTC GGCGTGGCCG GATTCCGCGT GGACGCCGCC AAGCACATGT GGCCCGCCGA 108 0 

CCTGGCCGTC ATCTATGGCC GCCTCAAGAA CCTAAACACC GACCACGGCT TCGCCTCGGG 114 0 

ATCCAAGGCG TACATCGTCC AGGAGGTCAT CGACATGGGC GGCGAGGCCA TCAGCAAGTC 12 00 

CGAGTACACC GGACTGGGCG CCATCACCGA GTTCCGCCAC TCCGACTCCA TCGGCAAGGT 1260 

CTTCCGCGGC AAGGACCAGC TGCAGTACCT GACCAACTGG GGCACCGCCT GGGGCTTCGC 1320 

TGCCTCCGAC CGCTCCCTGG TATTCGTCGA CAACCACGAC AATCAGCG CG GACATGGAGC 13 8 0 

AGGAGGCGCC GACGTTCTGA CCTACAAGGT GCCCAAGCAG TACAAGATGG CCTCCGCCTT 1440 

CATGCTGGCG CACCCCTTCG GCACTCCCCG CGTGATGTCC TCCTTCTCCT TCACGGACAC 15 00 
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CGATCAGGGC CCGCCCACCA CCGACGGCCA CAACATCGCC TCGCCCATCT TCAATAGCGA 1560 
CAACTCCTGC AGCGGCGGCT GGGTGTGTGA GCACCGCTGG CGCCAGATCT ACAACATGGT 16 2 0 

GGCCTTCCGA AACACCGTGG GCTCGGACGA GATCCAGAAC TGGTGGGACA ACGGCAGCAA 16 80 

CCAGATCTCC TTCAG CCGAG GCAGCCGCGG CTTCGTGGCC TTCAACAACG ACAACTACGA 1740 

CCTGAACAGC TCCCTGCAGA CGGGCCTGCC CGCCGGCACC TACTGCGACG TCATCTCCGG 18 00 

CTCCAAGAGC GGTTCCTCCT GCACGGGCAA GACCGTCACC GTCGGATCCG ACGGACGGGC 1860 

TTCCATCAAC ATTGG CAGCT CCGAGGACGA CGGAGTGCTG GCCATTCACG TCAACGCCAA 192 0 

GTTGTAAGCT TGTAAACAGC TGGGGAGCAT GGCGAACAGC CAGGCAATTA ATTGAGATTA 1980 

TTAATTGTAC GAAATATATA TGATGAGATT ATAAACACAC AACACTTTTA TTCGCAAGGG 2 04 0 

ATGATAAGAA TCTAATATAT ATATTATCTG GGCTTCAAAG CATTGATTTT ATTTATTGAG 2100 

TCAAGAGGGA AATTTATTTT CTTGTTATTC TCTGTCCAGG TCTAAAGTCC CGAGCGGTGA 216 0 

GGCTATCTAT TGATTTGGAC ATTCCAATCG AATACAAAAC AGAGATACAG AAATTTCGGA 222 0 

AAAAATTTGA TAACAATCGT GGATTTTAGC AATTAGACAA ATTGATATGT GCTTGCTAAT 22 8 0 

TGATGTGGCA TGAAATAAGA AATTTATAAG GACGTTTTCA AGTG CTTCTA TTTTAAACAT 2 34 0 

TCAGGATTTT TTTTTAAAGC AGACAGCTTT CAACAGGTTT GATCAGAATT TGAATATTGA 24 0 0 

TTGTTGACTT TAGCTATACA TAAATCACAC CTCATCCACC CATTGTGGTA TCCTTCGAAG 2 46 0 

GACTTGGGAA CTGGATCCTC TAGAGCTTCC GGCCATTAGC CATATTATTC ATTGGTTATA 2 52 0 

TAGCATAAAT CAATATTGGC TATTGGCCAT TGCATACGTT GTATCCATAT CATAATATGT 25 8 0 

ACATTTATAT TGGCTCATGT CCAACATTAC CGCCATGTTG ACATTGATTA TTGACTAGTT 2 64 0 

ATTAATAGTA ATCAATTACG GGGTCATTAG TTCATAGCCC ATATATGGAG TTCCGCGTTA 27 00 

CATAACTTAC GGTAAATGGC CCGCCTGGCT GACCGCCCAA CGACCCCCGC CCATTGACGT 2 76 0 

CAATAATGAC GTATGTTCCC AT AGTAAC G C CAATAGGGAC TTTCCATTGA CGTCAATGGG 2 82 0 

TGGAGTATTT ACGGTAAACT GCCCACTTGG CAGTACATCA AGTGTATCAT ATGCCAAGTA 2 88 0 

CGCCCCCTAT TGACGTCAAT GACGGTAAAT GGCCCGCCTG GCATTATGCC CAGTACATGA 2 94 0 

CCTTATGGGA CTTTCCTACT TGGCAGTACA TCTACGTATT AGTCATCGCT ATTACCATGG 3 000 

TGATGCGGTT TTGGCAGTAC ATCAATGGGC GTGGATAGCG GTTTGACTCA CGGGGATTTC 3 06 0 

CAAGTCTCCA CCCCATTGAC GTCAATGGGA GTTTGTTTTG GCAC CAAAAT CAACGGGACT 312 0 

TTCCAAAATG TCGTAACAAC TCCGCCCCAT TGACGCAAAT GGGCGGTAGG CATGTACGGT 318 0 
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GGGAGGTCTA TATAAGCAGA GCTCGTTTAG TGAACCGTCA GATCGCCTGG AGACGCCATC 324 0 

CACGCTGTTT TGACCTCCAT AGAAGACACC GGGACCGATC CAGCCTCCGC GGCCCCAAGC 33 00 

TCATGGTTCT GGCCAAGAGC ATAGTGTGCC TCGCCCTCCT GGCGGTGGCG CTAGCCCAGT 3 360 

ACAATCCCAA CACTCAGGCT GGGAGGACAT CTATCGTGCA TCTCTTTGAA TGGCGCTGGG 3420 

CCGACATTGC ACTGGAGTGC GAACACTATT TAGCTC CTAA TGGGTTTGGA GGAGTTCAGG 34 80 

TTTCTCCTCC AAATGAAAAC ATTGTCATTA CTAATCCGAA CAGGCCCTGG TGGGAAAGAT 3 54 0 

ACCAGCCCAT C AG CTACAAG ATCTGCAGTC GATCGGGCAA TGAAAATGAA TTCAGAGACA 3 6 00 

TGGTGACCAG ATGCAACAAT GTTGGAGTTC GTATTTATGT GGATGCTGTT GTCAATCACA 3 6 60 

TGTGTGGATC TATGGGTGGC ACGGGCACCC ACTCAACATG TGGGAGCTAT TTCAACACCG 3 7 20 

GGACTAGAGA TTTTCCCGCT GTGCCGTACT CTGCCTGGGA TTTCAATGAC GGCAAATGTC 3 7 80 

ACACTGCAAG TGGAGACATC GAAAATTATG GGGACATGTA TCAGGTCCGG GATTGCAAGT 3 84 0 

TGTCCAGCCT TCTTGATCTG GCTCTGGAGA AGGACTATGT ACGCTCAACA ATTGCAGCGT 3 9 00 

ACATGAATCA CCTCATTGAT ATGGGTGTAG CAGGGTTCCG GATCGATGCT GCCAAGCATA 3 960 

TGTGGC CAGG GGACATAAGA GCGTTTCTGG ACAAACTGCA CG AT CTAAAT ACTCAGTGGT 4.02 0 

TTT CAGCAGG AACGAAACCC TTTATTTACC AAGAGGTAAT TGACTTGGGA GGAGAGCCAA 4 0 80 

TCACAGGCAG TCAGTACTTT GGGAATGGCC GCGTGACAGA ATTCAAGTAT GGTGCCAAAC 4.14 0 

TGGGGACGGT GATC CGGAAG TGGAATGGAG AGAAGATGGC CTACTTAAAG AACTGGGGAG 4 2 00 

AAGGCTGGGG CTTTGTGCCT TCTGACAGAG CCCTGGTGTT TGTGGATAAC CACGACAACC 4 260 

AGCGGGGGCA CGGGGCAGGC GGAGCTTCCA TTCTTACTTT CTGGGATGCC AGGCTTTATA 4 320 

AAATGGCGGT TGGTTTCATG CTCGCTCATC CGTACGGGTT CACACGGGTG ATGTCAAGTT 4 3 80 

ATCGTTGGCC AAGATATTTC GAAAACGGAG TGGATGTTAA CGACTGGGTG GGACCACCAA 4 44 0 

GTAACTCGGA CGGATCGACG AAGTCCGTTA CAATCAACGC AGACACTACC TGTGGC AATG 4 5 00 

ACTGGGTCTG CGAACAT CGC TGGCGACAAA TAAGGAACAT GGTTATCTTC CGTAATGTGG 4 560 

TAGACGGTCA GCCTTTCTCA AACTGGTGGG ACAACGGGAG CAATCAAGTA GCTTTCGGTC 4620 

GCGGCGACAG AGGCTTCATT GTCTTTAATA ATGATGACTG GTATATGAAT GTCGATTTGC 4 6 80 

AAACTGGTCT GCCTGCTGGA ACCTACTGCG ATGTTATTTC TGGACAAAAG GAAGGCAGTG 4 74 0 

CGTGTACTGG AAAGCAGGTG TACGTTTCCT CGGATGGAAA GGCCAATTTC CAGATTAGTA 4 8 00 

ACAGCGATGA AGATCCATTT GTTGCAATTC ACGTTGATGC CAAGTTATAA GGCCGGCCGC 4 860 
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TTCGAGCAGA 


CATGATAAGA 


TACATTGATG 


AGTTTGGACA 


AACCACAACT 


AGAATGCAGT 


4920 


GAAAAAAATG 


CTTTATTTGT 


GAAATTTGTG 


ATGCTATTGC 


TTTATTTGTA 


ACCATTATAA 


4980 


GCTGCAATAA 


ACAAGTTAAC 


AACAACAATT 


GCATTCATTT 


TATGTTT C AG 


GTTCAGGGGG 


5040 


AGGTGTGGGA 


GGTTTTTTAA 


AGCAAGTAAA 


ACCTCTACAA 


ATGTGGTAAA 


ATCGATAAGG 


5100 


ATC CGTCGAC 


CGATGCCCTT 


GAGAGCCTTC 


AACCCAGTCA 


GCTCCTTCCG 


GTGGGCGCGG 


5160 


GGCATGACTA 


TCGTCGCCGC 


ACTTATGACT 


GTCTTCTTTA 


TCATGCAACT 


CGTAGGACAG 


5220 


GTGCCGGCAG 


CGCTCTTCCG 


CTTCCTCGCT 


CACTGACTCG 


CTGCGCTCGG 


TCGTTCGGCT 


5280 


GCGGCGAGCG 


GTATCAGCTC 


ACTCAAAGGC 


GGTAATACGG 


TTATCCACAG 


AATCAGGGGA 


5340 


TAACGCAGGA 


AAGAACATGT 


GAGCAAAAGG 


CCAGCAAAAG 


GCCAGGAACC 


GTAAAAAGGC 


5400 


CGCGTTGCTG 


GCGTTTTTCC 


ATAGGCTCCG 


CCCCCCTGAC 


GAGCATCACA 


AAAATCGACG 


5460 


CTCAAGTCAG 


AGGTGGCGAA 


ACCCGACAGG 


ACTATAAAGA 


TACCAGGCGT 


TTCCCCCTGG 


5520 


AAGCTCCCTC 


GTGCGCTCTC 


CTGTTCCGAC 


CCTGCCGCTT 


AC CGGATAC C 


TGTCCGCCTT 


5580 


TCTCCCTTCG 


GGAAGCGTGG 


CGCTTTCTCA 


TAGCTCACGC 


TGTAGGTATC 


TCAGTTCGGT 


5640 


GTAGGTCGTT 


CGCTCCAAGC 


TGGGCTGTGT 


GCACGAACCC 


CCCGTTCAGC 


CCGACCGCTG 


5700 


CGCCTTATCC 


GGTAACTATC 


GTCTTGAGTC 


CAAC CCGGT A 


AGACACGACT 


TATCGCCACT 


5760 


GGCAGCAGCC 


ACTGGTAACA 


GGATTAGCAG 


AGCGAGGTAT 


GTAGGCGGTG 


CTACAGAGTT 


5820 


CTTGAAGTGG 


TGGCCTAACT 


ACGGCTACAC 


TAGAAGGACA 


GTATTTGGTA 


TCTGCGCTCT 


5880 


GCTGAAGCCA 


GTTACCTTCG 


GAAAAAGAGT 


TGGTAGCTCT 


TGATCCGGCA 


AACAAACCAC 


5940 


CGCTGGTAGC 


GGTGGTTTTT 


TTGTTTGCAA 


GCAGCAGATT 


ACGCGCAGAA 


AAAAAGGATC 


6000 


TCAAGAAGAT 


CCTTTGATCT 


TTTCTACGGG 


GTCTGACGCT 


CAGTGGAACG 


AAAACTCACG 


6060 


TTAAGGGATT 


TTGGTCATGA 


GATTATCAAA 


AAGGATCTTC 


ACCTAGATCC 


TTTTAAATTA 


6120 


AAAATGAAGT 


TTTAAATCAA 


TCTAAAGTAT 


ATATGAGTAA 


ACTTGGTCTG 


ACAGTTACCA 


6180 


ATGCTTAATC 


AGTGAGGCAC 


CTATCTCAGC 


GATCTGTCTA 


TTTCGTTCAT 


CCATAGTTGC 


6240 


CTGACTCCCC 


GTCGTGTAGA 


TAACTACGAT 


ACGGGAGGGC 


TTACCATCTG 


GCCCCAGTGC 


6300 


TGCAATGATA 


CCGCGAGACC 


CACGCTCACC 


GGCTC CAGAT 


TTATCAGCAA 


TAAACCAGCC 


6360 


AGCCGGAAGG 


GCCGAGCGCA 


GAAGTGGTCC 


TGCAACTTTA 


TCCGCCTCCA 


TCCAGTCTAT 


6420 


TAATTGTTGC 


CGGGAAGCTA 


GAGTAAGTAG 


TTCGCCAGTT 


AATAGTTTGC 


GCAACGTTGT 


6480 


TGCCATTGCT 


ACAGGCATCG 


TGGTGTCACG 


CTCGTCGTTT 


GGTATGGCTT 


CATTCAGCTC 


6540 
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CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA AAGCGGTTAG 66 00 

CTCCTTCGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT CACTCATGGT 666 0 

TATGG CAGCA CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT TTTCTGTGAC 6 72 0 

TGGTGAGTAC TCAACCAAGT CATTCTGAGA ATAGTGTATG CGGCGACCGA GTTGCTCTTG 678 0 

CCCGGCGTCA ATACGGGATA ATACCGCGCC ACATAGCAGA ACTTTAAAAG TGCTCATCAT 684 0 

TGGAAAACGT TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA GATCCAGTTC 6 9 00 

GATGTAACCC ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA CCAGCGTTTC 6 960 

TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG GGAATAAGGG CGACACGGAA 7 02 0 

ATGTTGAATA CTCATACTCT TCCTTTTTCA ATATTATTGA AGCATTTATC AGGGTTATTG 7080 

TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG GGGTTCCGCG 714 0 

CACATTTCCC CGAAAAGTGC CACCTGACGC GCCCTGTAGC GGCGCATTAA GCGCGGCGGG 7 2 00 

TGTGGTGGTT ACGCGCAGCG TGACCGCTAC ACTTGCCAGC GCCCTAGCGC CCGCTCCTTT 72 60 

CGCTTTCTTC CCTTCCTTTC TCGCCACGTT CGCCGGCTTT CCCCGTCAAG CTCTAAATCG 73 2 0 

GGGGCTCCCT TTAGGGTTCC GATTTAGTGC TTTACGGCAC CTCGACCCCA AAAAACTTGA 73 8 0' 

TTAGGGTGAT GGTTCACGTA GTGGGCCATC GCCCTGATAG ACGGTTTTTC GCCCTTTGAC 74 4 0 

GTTGGAGTCC ACGTTCTTTA ATAGTGGACT CTTGTTCCAA ACTGGAACAA CACTCAACCC 7 5 00 

TATCTCGGTC TATTCTTTTG ATTTATAAGG GATTTTGCCG ATTTCGGCCT ATTGGTTAAA 7 56 0 

AAATGAGCTG ATTTAACAAA AATTTAACGC GAATTTTAAC AAAATATTAA CGTTTACAAT 76 20 

TTCCCATTCG CCATTCAGGC TGCGCAACTG TTGGGAAGGG CGATCGGTGC GGGCCTCTTC 76 8 0 

GCTATTACGC CAGCCCAAGC TACCATGATA AGTAAGTAAT ATTAAGGTAC GGGAGGTACT 7 74 0 

TGGAGCGGCC GCAATAAAAT ATCTTTATTT TCATTACATC TGTGTGTTGG TTTTTTGTGT 7 8 00 

GAATCGATAG TACTAACATA CGCTCTCCAT CAAAACAAAA CGAAACAAAA CAAACTAGCA 7 86 0 

AAATAGGCTG TCCCCAGTGC AAGTGCAGGT G C C AG AACAT TTCTCTATCG ATA 7 913 
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THE EMBODIMENTS OF THE INVENTION IN WHICH AN EXCLUSIVE 
PROPERTY OF PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS: 



1 . An extra-cellular reporter gene system for use in a target cell, comprising a DNA 
sequence encoding: 

a signal peptide; 

at least one a-amylase protein; and 
a transcription termination region. 

2. The reporter gene system of claim 1, wherein the a-amylase protein is 
electrophoretically distinct from any endogenous amylase in the target cell and wherein 
the a-amylase protein is compatible with the target cells expression and secretion 
systems. 

3. The reporter gene system of claim 2, wherein the reporter gene system 
comprises DNA sequences encoding two a-amylase proteins. 

4. The reporter gene system of claim 3, wherein the a-amylase proteins are 
electrophoretically distinct from each other. 

5. The reporter gene system of claim 2, wherein the a-amylase protein is selected 
from the group consisting of chicken a-amylase and Drosophila melanogaster a- 
amylase. 

6. The reporter gene system of Claim 5, wherein the signal peptide is derived 
from a D. melanogaster a-amylase gene. 

7. The reporter gene system of Claim 6, wherein the signal peptide is encoded by 
the DNA sequence of SEQID No: 5. 
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8. The reporter gene system of Claim 7, wherein the chicken a-amylase protein 
is encoded by the DNA sequence of SEQID No: 1. 

9. The reporter gene system of Claim 7, wherein the D. melanogaster a-amylase 
protein is encoded by the DNA sequence of SEQID No: 8. 

10. The reporter gene system of Claim 7, wherein the transcription termination 
region is derived from the group consisting of a D. melanogaster a-amylase gene and 
a late polyadenylation site of SV40. 

11. The reporter gene system of Claim 10, wherein the transcription termination 
region is derived from the D. melanogaster a-amylase gene and is encoded by the 
DNA sequence of SEQID No: 2. 

12. The reporter gene system of Claim 1, wherein said system further comprises 
a promoter region. 

13. The reporter gene system of Claim 12, wherein the promoter region is selected 
from the group consisting of: a Cytomegalovirus promoter, a murine retroviral 
promoter and a rat p-actin promoter. 

14. The reporter gene system of claim 12, wherein the a-amylase protein is 
electrophoretically distinct from any endogenous amylase in the target cell and wherein 
the a-amylase protein is compatible with the target cells expression and secretion 
systems. 

15. The reporter gene system of claim 14, wherein the reporter gene system 
comprises DNA sequences encoding two a-amylase proteins. 

16. The reporter gene system of claim 15, wherein the a-amylase proteins are 
electrophoretically distinct from each other. 
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17. The reporter gene system of claim 14, wherein the a-amylase protein is selected 
from the group consisting of chicken a-amylase and Drosophila melanogaster a- 
amylase. 

18. The reporter gene system of Claim 17, wherein the signal peptide is derived 
from a D. melanogaster a-amylase gene. 

19. The reporter gene system of Claim 18, wherein the signal peptide is encoded 
by the DNA sequence of SEQID No: 5. 

20. The reporter gene system of Claim 17, wherein the chicken a-amylase protein 
is encoded by the DNA sequence of SEQID No: 1. 

21 . The reporter gene system of Claim 17, wherein the D. melanogaster a-amylase 
protein is encoded by the DNA sequence of SEQID No: 8. 

22. The reporter gene system of Claim 17, wherein the transcription termination 
region is derived from the group consisting of a D. melanogaster a-amylase gene and 
a late polyadenylation site of SV40. 

23. The reporter gene system of Claim 22, wherein the transcription termination 
region is derived from the D. melanogaster a-amylase gene and is encoded by the 
DNA sequence of SEQID No: 2. 

24. The reporter gene system of Claim 11, wherein the extra-cellular reporter gene 
system comprising a DNA sequence encoding: 

a signal peptide derived from a Drosophila melanogaster a-amylase gene; 
a chicken a-amylase protein; and 

a transcription termination region derived from a Drosophila melanogaster a- 
amylase gene. 
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25. The reporter gene system of Claim 23, wherein the extra-cellular reporter gene 
system comprising a DNA sequence encoding: 

a Cytomegalovirus promoter, 

a signal peptide derived from a Drosophila melanogaster a-amylase gene; 
a chicken a-amylase protein; and 

a transcription termination region derived from a Drosophila melanogaster a- 
amylase gene. 

26. The reporter gene system of Claim 11, wherein the reporter gene system 
comprising a DNA sequence encoding: 

a signal peptide derived from a Drosophila melanogaster a-amylase gene: 
a Drosophila a-amylase protein; and 

a transcription termination region derived from a Drosophila melanogaster a- 
amylase gene. 

27. The reporter gene system of Claim 23, wherein the extra-cellular reporter gene 
system comprising a DNA sequence encoding: 

a Cytomegalovirus promoter, 

a signal peptide derived from a Drosophila melanogaster a-amylase gene: 
a Drosophila a-amylase protein; and 

a transcription termination region derived from a Drosophila melanogaster a- 
amylase gene. 

28. The reporter gene system of Claim 16, wherein the extra-cellular reporter gene 
system comprising a DNA sequence encoding: 

two expression cassettes, wherein the first cassette comprises a DNA sequence 
encoding: 

a signal peptide derived from a Drosophila melanogaster a-amylase gene; 
a Drosophila a-amylase protein; and 

a transcription termination region derived from a Drosophila melanogaster a- 
amylase gene; 
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and wherein said second cassette comprises a DNA sequence encoding: 
a Cytomegalovirus promoter, 

a signal peptide derived from a D. melanogaster a-amylase gene; 
a chicken a-amylase protein; and 

a transcription termination region derived from a late polyadenylation site of 
SV40; 

wherein the first cassette ftirther comprises a polylinker region immediately 3' of the 
D. melanogaster a-amylase gene to facilitate insertion of a promoter sequence of 
interest to determine the strength of said promoter. 

29. The reporter gene system of any of the preceding claims were the target cell is 
a mammalian cell. 

30. A kit comprising the reporter gene system of any of the preceding claims, a 
means of detecting the amylase activity and instructions for use. 

31. The kit of claim 30 ftirther comprising a beta-dextrin system to remove non- 
specific proteins from high-protein fluids. 



32. The kit of claim 30 or 31, wherein the means of detecting the amylase system 
is selected from the group consisting of a plate assay and a gel electrophoresis means. 
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5'-G CT AGCTC AG TACAATCCCA ACACTCAGGC TGGG AGGACA TCTATCGTGC ATCTCTTTG A 
ATGGCGCTGG GCCGACATTG CACTGGAGTG CGAACACTAT TTAGCTCCTA ATGGGTTTGG 
AGGAGTTCAG GTTTCTCCTC CAAATGAAAA CATTGTCATT ACTAATCCGA AC AG GCCCTG 
GTGGG AAAGA TACCAGCCCA TCAGCTACAA G ATCTGCAGT CG ATCGGGCA ATG AAAATG A 
ATTCAGAGAC ATGGTGACCA GATGCAACAA TGTTGGAGTT CGTATTTATG TGGATGCTGT 
TGTCAATCAC ATGTGTGGAT CTATGGGTGG CACGGGCACC CACTCAACAT GTGGGAGCTA 
TTTCAACACC GGGACTAGAG ATTTTCCCGC TGTGCCGTAC TCTGCCTGGG ATTTC A ATG A 
CGGCAAATGT CACACTGCAA GTGGAGACAT CGAAAATTAT GGGG ACATGT ATCAGGTCCG 
GGATTGCAAG TTGTCCAGCC TTCTTGATCT GGCTCTGGAG AAGGACTATG TACGCTCAAC 
AATTGCAGCG TACATGAATC ACCTCATTGA TATGGGTGTA GCAGGGTTCC GG ATCGATGC 
TGCCAAGCAT ATGTGGCCAG GGGACATAAG AGCGTTTCTG GACAAACTGC ACG A TCTA A A 
TACTCAGTGG TTTTCAGCAG GAACGAAACC CTTTATTTAC CAAGAGGTAA TTGACTTGGG? 
AGGAGAGCCA ATCACAGGCA GTCAGTACTT TGGG AATGGC CGCGTGACAG AATTCAAGTA 
TGGTGCCAAA CTGGGGACGG TGATCCGGAA GTGG AATGGA GAG AAG ATGG CCTACTTA A A 
GAACTGGGGA GAAGGCTGGG GCTTTGTGCC TTCTGACAGA GCCCTG GTGT TTGTGGATAA 
CCACGACAAC CAGCGGGGGC ACGGGGCAGG CGGAGCTTCC ATTCTTACTT TCTGGGATGC 
CAGGCTTTAT AAAATGGCGG TTGGTTTCAT GCTCGCTCAT CCGTACGGGT TCACACGGGT 
GATGTCAAGT TATCGTTGGC CAAGATATTT CGAAAACGGA GTGGATGTTA ACGACTGGGT 
GGGACCACCA AGTAACTCGG ACGGATCGAC GAAGTCCGTT ACAATCAACG C AG AC ACTA C 
CTGTGGCAAT GACTGGGTCT GCGAACATCG CTGGCGACAA ATAAGGAACA TGGTTATCTT 
CCGTAATGTG GTAGACGGTC AGCCTTTCTC AAACTGGTGG GACAACGGGA GCAATCAAGT 
AGCTTTCGGT CGCGGCGACA GAGGCTTCAT TGTCTTTAAT AATGATGACT GGTATATG A A 
TGTCGATTTG CAAACTGGTC TGCCTGCTGG AACCTACTGC GATGTTATTT CTGGACAAAA 
GGAAGGCAGT GCGTGTACTG GAAAGCAGGT GTACGTTTCC TCGGATGGAA AGGCCAATTT 
CCAGATTAGT AACAGCGATG AAGATCCATT TGTTGCAATT CACGTTGATG CCA AGITATA 
AGCTT-3' 
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5 ' - AAGCTTGT AA ACAGCTGGGG AGCATGGCGA ACAGCCAGGC AATTAATTGA GATTATTAAT 
TGTACGAAAT ATATATGATG AGATTATAAA CACACAACAC TTTTATTCGC AAGGGATGAT 
AAGAATCTAA TATATATATT ATCTGGGCTT CAAAGCATTG ATTTTATTTA TTGAGTCAAG 
AGGGAAATTT ATTTTCTTGT TATTCTCTGT CCAGGTCTAA AGTCCCGAGC GGTGAGGCTA 
TCTATTGATT TGGACATTCC AATCGAATAC AAAACAGAGA TACAGAAATT TGCGAAAAAA 
TTTGATAACA ATCGTGGATT TTACGAATTA GACAAATTGA TATGTGCTTG CTAATTGATG 
TGGCATGAAA TAAGAAATTT ATAAGGACGT TTTCAAGTGC TTCTATTTTA AACATTCAGG 
ATTTTTTTTT AAAGCAGACA GCTTTCAACA GGTTTGATGA GAATTTGAAT ATTGATTGTT 
GACTTTAGCT ATACATAAAT CACACCTCAT CCACCCATTG TGGTATCCTT CGAAGGACTT 
GGGAACTGGA TCCTCTAGAA GGCGCGCC-3' 
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